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LETTER FROM THE EDITOR 


We Did It Again — 
Only Better This Time 


ast year, Xilinx debuted at #14 on the FORTUNE 100 Best Companies to Work For™ 
i gannual list. This year, we broke into the Top 10 with a ranking of #6 — the only Silicon 
Valley/San Francisco Bay Area company to make the Top 10. 


To help determine the new rankings, FORTUNE weighed how companies reacted to the economic 
downturn of 2001. Although revenue at Xilinx declined by 50 percent over the course of 2001, the 
company decided employee layoffs would come only as a last resort. As a result of significant cost- 
cutting measures — including tiered salary cuts for everyone except the lowest paid employees — Xilinx 
managed to avoid layoffs, continued to bring remarkable new products to market, and became larger 


than all other public programmable logic companies combined. 
We believe the FORTUNE magazine ranking reflects the strong values we hold at Xilinx: 


¢ Customer focus 


es “MANY XILINX EMPLOYEES HAVE TOLD ME HOW PROUD THEY ARE 10 
ae RECEIVE THIS RECOGNITION. 1T SPEAKS AS MUCH ABOUT THE GREAT 
CULTURE WE HAVE CREATED AT XILINX AS 1S DOES ABOUT THE QUALITY 
OF PEOPLE WHO WORK HERE. 


¢ Accountability 
¢ Teamwork 

° Integrity 

¢ Very open communication — WIM ROELANDTS, CEO, XILINX INC. 
¢ Enjoying our work. 


These CREATIVE values are a vital part of our culture, and they make a positive difference for our 
employees and our customers. Our values make Xilinx a great company to work for — and a great 
company to work with. Our technology, our service, and our customer satisfaction continue to lead 
the industry. We are always seeking new ways to make a real difference in our world. And, though 
technology drives our business, we've built our success on the strong partnerships we've created with 


our customers, our suppliers, and other technology leaders in our industry. 


To learn more about the FORTUNE magazine's top 100 companies to work for, go to: 


www.fortune.com/lists/bestcompanies/. 


Din Boivin. 


Tom Durkin 
Managing Editor 





How to Survive a Recession 
and Gain Market Leadership 


“| said from the beginning that we wanted to emerge trom the 
a recession a greater company — and | think we have achieved that. 


Perspective | Page S 3) 
When Total Cost Management —_— 
Counts, Xilinx PLDs Pay Off 


CoolRunner-II CPLDs, Spartan-IlE FPGAs, and Virtex-II EasyPath 
Plattorm FPGAs can give you total system cost solutions superior 
to those of traditional ASICs. 





" 






Exploring Hardware /Sottware 
Co-Design with Virtex-Il Pro FPGAs 


To explore the possibilities of hardware /sottware co-design, 
the Xilinx Design Services team created a reference design for 
a real-time operating system for telecommunications. 





Technology Focus 


Customize Virtual Private Networks 
with Spartan-IIE FPGA Solutions 


SpartanllE FPGA solutions can help you to customize your virtual private 
network for highest performance, greatest security, and lowest cost. 









- How to Build an Efficient FIR Filter 
Using System Generator 


Xilinx System Generator 2.2 software enables high-level modeling 
and implementation of DSP systems in Xilinx Platform FPGAs to 
outperform traditional DSP processors. 








New Products 


Develop MicroBlaze Applications with Three i . f 
Flexible Hardware Evaluation Platforms Seas 


MicroBlaze Development Kits trom Memec Design demonstrate the 
versatility of the MicroBlaze soft processor core in a variety of Xilinx FPGAs. 








Reference 





CoolRunner-Il Data Sheet 


RealDigital CPLDs consume only microwatts of power, yet provide 
very high pertormance — take a look at what’s “under the hood.” 
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by Wim Roelandts 
CEO, Xilinx, Inc. 


Last year was a 
very difficult year 
for the semicon- 
ductor industry. 
In fact, it was the 





worst recession 
ever in the history of semiconductors. In 
1985, the industry decreased in a year- 
over-year basis by 17% to 18%. In 2001, 
the decline was 32% to 33% — almost 


twice as bad as 1985. 


Nevertheless, I believe that it’s in the 
recessions that the good companies dis- 
tance themselves from the not so good 
companies. Even though our business was 
very weak, we gained market share during 
this recession in quite a dramatic way. 
Last quarter, for the first time in our his- 
tory, we became larger than all the other 
public programmable logic device compa- 


nies combined. 


From a strategic point of view, we want to 
win through product leadership. That 
means we have to have the best products — 
always from the customer's point of view — 
which means the highest densities, the 
fastest performance, and the software and 


service to support them. 


When you are an industry that is new 
and that is still changing very rapidly, 
you have to continue to innovate. You 
have to continue to bring new products 
into the market. 


Despite the recession, the last part of 
2001 and the early part of 2002 was one 
of our most productive periods for new 
product introductions. In about five 
months time, we introduced a new 
family in every one of our product cate- 
gories — Spartan™, CoolRunner™, and 
Virtex™. Each, in its own way, is unique 
in its capabilities. In fact, two of these 
new products — the CoolRunner-II 
Realicital CREW) and the. ¥ irtexall 
Pro™ Platform FPGA — have no compe- 
tition. Literally, there are no comparable 
products on the market that can compete 
directly with them. 
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Total Solutions 


I’m very excited about the programmable 
logic industry. It’s an industry that’s still in 
the early phases, but over time, it will 
become the dominant way to implement 


integrated circuit designs. 


The inherent flexibility of programmable 
logic allows you to gain time to market, to 
introduce new products very quickly, and 
to upgrade your product, even after the 


sale, through field-upgradeable technology. 


We're not just focusing on the chips, how- 
ever. Our chips must also have the soft- 
ware, the intellectual property cores, the 
training, and the support to deliver total 


solutions to our customers. 


And we have to continue to be innovative 
in all these areas of IC technology. My 
whole belief is that Xilinx must excel in a 
continuous push to innovation — through- 
out the company, be it in chip design, soft- 
ware development, or marketing. 
Innovation is absolutely critical — especially 


in times of recession. 
No Layoffs — Not Just a Nice Thing to Do 


This recession has put a tremendous strain 
on the industry. When I saw this decline 
coming, I concluded that it would be best 
for us if we could avoid layoffs — even 
though the economic consequences would 
be very difficult. I wanted to avoid layoffs, 
because once you start layoffs, people get 
worried — and if people are worried, they 


cannot innovate. 


Innovation means that you have to take risks 
that some things are not going to work, and 
therefore, if you are worried about your job, 
the mantra becomes, “Don’t rock the boat. 
Dont push yourself or make yourself too 
visible” — which is totally opposite from 


what we want to see happening. 


If people are worried about their jobs, 
clearly, they're not going to be as produc- 
tive. Layoffs affect not only the people who 
get laid off. Very often the people who are 
still there go through trauma, because they 
see their friends disappear — and they 
always have this fear: “Maybe next time it 


will be me.” 


Whenever you lay off employees, some of 
the accumulated knowledge of the com- 
pany — its “intellectual capital” — is going 
to walk out the door. Especially in the 
high-tech industry, the value of the com- 
pany is in the heads of its engineers and 
support staff. At Xilinx, 70% or 80 % of 
our employees are highly educated people 
working in engineering and marketing 
and supply-chain management. They are 
all experts and specialists in their own 
way — and that’s why it’s so important to 
keep these people happy at Xilinx. Once 
people like that leave, that intellectual 
capital goes out the door with them, and 
that’s a lot of value for a cutting-edge 
company to lose. 


Several compelling studies have proved that 
if you go back two years later, the compa- 
nies that laid off employees are generally in 
worse shape than companies that didnt. 
Layoffs have a very profound impact on the 
morale and the trust between management 
and employees. Furthermore, a lot of 
know-how has left the company. And it 
takes a long time to get it back. 


For these reasons, and more, I decided to 


try to avoid layoffs. 
Pay Cuts — An Innovative Approach 


The reality of economics is that when busi- 
ness drops, you have to bring your labor 
costs in line. With layoffs not being an 
option, the way we cut our labor costs was 
by a very innovative program where we had 
a tiered salary reduction that was propor- 
tional to the size of each worker’s salary. 
The lowest paid people had no reductions, 
and the highest paid people had the biggest 


reductions (myself included). 


In implementing this policy, we discov- 
ered that we should give people some 
choices and make sure they get something 
in return. People had the choice to either 
take the salary reduction or to take vaca- 
tion time — or take stock options. Stock 
options arent money, but it is compensa- 
tion for the work people put in. We can't 
ask the employees to sacrifice unless we 
can compensate them for the extra flexi- 


bility we’re asking from them. 
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The tiered salary reduction was extremely 
well received in the company. The vast 
majority — and I’m talking about close to 
100% of the employees — felt this was the 
right thing to do. 


People even volunteered for the plan, which 
was important in Europe, because in 
Europe, companies cannot mandate salary 
reductions. It must be done on a voluntary 
basis. If people hadn't volunteered, we could 
not have done it, but pretty much all of our 
European employees volunteered for the 
salary reduction, because they felt solidarity 
with the company. 


With an acceptance of fair pay cuts and a 
promise of no layoffs, our employees felt 
safe to continue to risk and innovate. They 
have kept their attention full-time on their 
jobs. The net result has been that we have 
been able to maintain our aggressive sched- 
ule of new product rollouts — and this in a 
period when our business was off by 50% 
in three quarters. 


Moreover, we were not only able to remain 
in the black from a profit point of view, but 
during all this period, we were also able to 


remain cash-flow positive for the company. 
The Importance of Being “Fabless” 


One of the reasons we could afford to stay 
in the black is because Xilinx is a “fabless” 
semiconductor company. We don't own 
fabs [fabrication plants]. We don't have this 
big problem of having a fab that we need to 
keep busy. For us, if business goes down, 


we just buy fewer wafers. 


The same thing is true for our sales force. 
We don't have a Xilinx sales force. We 
work through distribution and sales reps, 
which again means that our sales costs are 
by and large variable. Sales people get 
paid a commission. If they don’t sell, 


there’s no commission. 


These two factors helped us tremendous- 
ly in remaining in the black and remain- 
ing cash-flow positive, because a lot of 
our expenses go up and down with the 


business climate. 


The good news is that we reached our 


lowest quarter in September of 2001. 
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Since then, business has been starting to 
improve. We were able to eliminate salary 
reductions early in 2002, and as of this 
moment, all salary reductions have been 
eliminated. The company continues to do 
very well from both a new product gener- 
ation point of view, as well as a business 


point of view. 
Innovation — Not Just for Engineers 


Innovation is not just the engineering 
department. Even in the way we manage 
the company, we are very innovative — 
and it really has paid off for us. Morale is 
very high, and the attrition rate is 
extremely low. People have kept their 
attention on their jobs of designing great 
new products and marketing them and 


supporting them. 


In general, I think Xilinx is very much an 
admired company in our industry. I think 
we are perceived as a leader in our industry 
— not just from a product point of view or 
a financial point of new, but also from a 


management point of view. 


I want to stress, however, that all of this is 
possible because we are a fabless company. 
If we had our own fabs, I don’t know if we 


could have done this. 


Nevertheless, as a fabless company, our 
business model lends itself very well to go 
though these economic cycles that are 
infamous in the semiconductor industry. 
This last recession was the worst, but it 
also served to prove that our business 
model is a very, very solid model that 
allows us to go through these ups and 


downs without too much hurt. 
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Conclusion 


In high technology, things change very, 
very rapidly. And companies often get in 
trouble because they don’t change quickly 
enough. One of the reasons they don't 
change quickly is that the people who see 
the change happen don’ act on it, and the 
people who can act on it dont even know 


the change is happening. 


Therefore, what we are creating at Xilinx 
is an environment where everybody says: 
“Hey, if it changes, let’s change with it. 
Let’s anticipate change. Let’s make the 
change happen ourselves. Let’s be in 
charge of the change — not the victim of 


the change.” 


It is this kind of thinking that got us through 
the recession, and now that we are emerging 
from the recession, there is a tremendous 
sense of pride in the company. People are 
saying: “Hey, this is unique. We have done 
something that very few other companies 
have done. Even companies that didn't lay 


people off didn’t do it the way we did.” 


There is absolutely no doubt in my mind 
that there is a much, much higher spirit of 
belonging, of community within the 
Xilinx people than there was before. When 
you go through a tough time together, it 


creates solid bonds. 


And it shows that, indeed, if you can man- 
age a company during a recession in a cor- 
rect way, you can really differentiate your- 
self from the competition. You can gain 
market share and emerge a stronger com- 
pany with a creative, dedicated — and intact 


— workforce at the end of recession. & 
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by Xilinx Statt 
IBM and Xilinx recently signed an agree- 


ment that could help you shave hundreds of 


thousands of dollars off the cost of creating 
custom chips. Under the agreement, IBM 
has licensed FPGA technology from Xilinx 
for integration into IBM’s recently 
announced Cu-08 application-specific inte- 
grated circuits (ASICs). Cu-08 will support 
circuits as small as 90 nanometers — less than 
1/1,000th the width of a human hair. 


This news underscores our joint commit- 
ment to a technology relationship aimed at 
bringing innovative and flexible new 
“hybrid” chips to market, combining the 
best attributes of standard ASIC and flexi- 
ble FPGA technology for use in communi- 


cations, storage, and consumer applications. 


Engineers working on complex chip designs 
have been clamoring for ways to achieve high 
levels of integration, yet still have the ability 
to change “on the fly” late in the design cycle. 
By combining FPGAs (circuits that can be 
configured to perform a wide variety of digi- 
tal electronic circuit functions) with standard 
ASICs, you get the flexibility of the FPGA, 
with the density, performance, and overall 


cost advantages of an ASIC — all on one chip. 


“Savings here could be dramatic,” said 
Michel Mayer, general manager, IBM 
Microelectronics Division. “When an 
ASIC takes on more function, you can 
reduce cost by eliminating one, two, or 
even more separate chips. With this tech- 
nology, customers would be able to tweak 
designs and integrate new changes imme- 
diately, eliminating the need to restart a 
whole new design cycle, bringing tremen- 


dous time-to-market advantages.” 
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Mayer said changes that force an addition- 


al chip prototype can easily cost hundreds 
of thousands of extra dollars and can 
stretch design cycles out for several addi- 
tional months. “This approach is expected 


to change the landscape entirely,” he said. 


Cu-08, with as many as eight layers of 
copper wiring separated by an advanced 
low-k insulation, will support up to 72 
million gates for high-complexity IC 
solutions. As many as 400,000 ASIC 
gates may be dedicated to one or more of 


the FPGA cores on the ASIC. 


“We've improved upon our delivery of pro- 
grammable hardware by allowing reconfigu- 
ration using the same chip. Flexibility is the 
beauty of combining ASIC and FPGA tech- 
nology,” said Wim Roelandts, president and 
CEO of Xilinx. “Our latest agreement with 
IBM is a natural extension of and a signifi- 
cant milestone in our existing relationship 
that allows us to address new opportunities 


in the high-end ASIC market.” 


Even after this new custom chip is built 
into a product, you can add to its func- 


tionality, simply, easily, and effectively. A 


IBM Partnership 


IBM ASICs Will Incorporate 
Xilinx FPGAs 


New ASICs trom IBM will contain 
Xilinx programmable logic technology. 


Perens es 


good example would be in the communica- 
tion industry, where various protocol and 
interface specifications are constantly evolv- 
ing. Cell phones, printers, set top boxes, 
and other consumer electronic products 


also are well suited for this new approach. 


IBM is the number one ASIC manufacturer 
worldwide. Xilinx is the number one PLD 
manufacturer worldwide, with more than 
15 years experience in FPGA design tools. 


IBM and Xilinx already have a productive 
relationship. IBM 
embedded PowerPC™ processors for 


Xilinx Virtex-I] Pro™ FPGAs. In March 
of this year, Xilinx signed a high-volume, 


supplies 


business 


multi-million dollar manufacturing agree- 
ment with IBM to manufacture these 
FPGAs using IBM’s advanced 130 
nanometer and 90 nanometer copper- 


based chip-making technology. 


The new FPGA cores for the Cu-80 
ASICS are now in development. They are 
expected to be available from IBM, 
embedded in an ASIC, in early 2004, fol- 
lowing IBM’s full release of its standard- 
setting Cu-08 technology. & 
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Perspective Cost Management 


When Total Cost 
Management Counts, 
Xilinx PLDs Pay Ott 


CoolRunner-ll CPLDs, Spartan-IlE FPGAs, and Virtex-ll EasyPath 
Platform FPGAs can give you total system cost solutions 
STN MOMINON ROM IUOTIOI UMW) Os 


by Eric Thacker 

Manager, Product Solutions Marketing 
Xilinx, Inc. 

eric. thacker@xilinx.com 


Rapidly changing markets and new tech- 
nologies are subverting traditional system 
design paradigms. Historically, in electronic 
systems development, the standard design 


procedure was to prototype a system using 
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programmable logic devices (PLDs), and 
then redesign the system using ASICs as 
soon as possible for cost reduction. The 
idea was to leverage the flexibility and 
development time advantage of program- 
mable logic until the design stabilized and 
then to convert the design to a lower-unit- 


cost ASIC to reduce overall system cost. 


In the past, programmable logic was lower 
density, lower performance, and more 
limited in capabilities. When systems fol- 
lowed traditional product life cycle behav- 
ior, the PLD-to-ASIC design strategy 
worked quite well. In fact, in some market 
segments, characterized by relatively 
mature technologies, established stan- 
dards, and a stable competitive environ- 


ment, this strategy is still optimal today. 


In dynamic, rapidly changing markets, 
however, this strategy becomes question- 
able. For many advanced electronic prod- 
ucts — such as plasma televisions, home 
networking hardware, and _ digital 
audio/video equipment — the market is 
anything but stable. Baseline standards 
for advanced products are ever-changing, 
features demanded by customers shift 
constantly, new technological capabilities 
are continually being introduced, and 
competitors are always trying to gain a 
better market position — resulting in 


many short-lived products. 


These volatile market pressures give a sig- 
nificant competitive advantage to flexible 
and first-to-market products. In these 
markets, ASICs — with their long lead 
times, high initial costs, high minimum 
order quantities, higher risks, and inflexi- 


bility — are not a compelling solution. 


Therefore, many designers are tapping 
into programmable logic’s benefits of 
instant reprogrammability, short lead 
times, greatly improved performance, 
expanded densities, and overall flexibility. 
Indeed, today’s programmable logic 
devices offer many of the same features 
found in ASICs. Clock management, 
embedded processors, high-performance 
DSP, advanced interfacing, and a pletho- 
ra of intellectual property (IP) cores all 
make today’s PLDs strong candidates for 


a systems core logic. 


With their emphasis on cost-optimiza- 
tion, the CoolRunner!™-II CPLDs, 
Spartan™-IIE FPGAs, and Virtex-II 
Easy Pathan nn ane 
Xilinx bring these advanced benefits to 


your high volume applications. 
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Opportunity Costs 


Inventory Costs 


Development Costs 


Total Cost 


Unit Costs 
NRE 


ASIC 


Figure I - Cost comparison of ASICs and FPGAs 


Total Cost Solution 


In the past, the primary motivation in con- 
verting a PLD design to one based on an 
ASIC was unit cost. ASICs, in general, are 
available at lower unit cost than PLDs. But 
looking at unit costs alone could actually 
lead to an overall higher cost solution. ‘That 
is because unit costs are only one of the fac- 
tors that contribute to total system cost. 
Total system costs can be broken down into 
three broad categories — design and develop- 


ment, production, and life cycle. 
Design and Development 


Design costs are primarily worker-hour and 
non-recurring engineering (NRE) costs. 
PLD designs require fewer worker hours, 
because design/debug changes can be made 
instantly, hardware and software can be 
developed concurrently, test vector genera- 
tion is unnecessary, and product lead times 
are practically zero compared to ASICs. 
PLDs have no NRE costs, which can be a 
very important consideration as production 
volumes per design revision are much lower 
in rapidly evolving markets. In many cases, 
Xilinx offers free PLD design tools, thus 


increasing the PLD cost advantage. 


There are also lost opportunity costs associat- 
ed with ASICs. Systems using ASICS lose 
market penetration due to long development 
times. Furthermore, the inflexibility of ASIC- 
based systems prevents upgrading products to 
meet changing market needs. Overall, PLD 
design cost advantages alone can often out- 


weigh any ASIC unit cost advantage. 
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FPGA 


Production 


Unit cost is the most 
obvious cost in this cat- 
egory. ASICs usually 
do hold the unit cost 
advantage over PLDs, 
but this cost differen- 


tial is shrinking. 


Today’s 


ageressively 


Development Costs 


PLDs _ have 


driven 


Unit Costs 


down both die size and 
package costs to provide 
a persuasive ASIC alter- 
native. The advanced 
system capabilities of 
modern PLDs allow 
designers to integrate discrete component 
functions into the PLD — reducing board 


and total component costs. 


Finally, unlike an inflexible ASIC, one line 
of PLDs can be inventoried to supply mul- 
tiple applications. You can reprogram and 
redeploy PLDs as needed. Combined with 
much lower minimum order quantities, 
PLDs reduce inventory costs, as well as the 
risk of obsolescence, further offsetting 


PLDs’ unit cost disadvantage. See Figure 1. 
Life Cycle 


Life cycle costs are probably the least consid- 
ered costs when designing an electronic sys- 
tem, but they can have a significant impact 
on the total return from a given design. 
Using PLDs instead of ASICs means practi- 
cally no risk of obsolete inventory, because 
standard-product PLDs can 
be redeployed to different 
applications as _ needed. 
Unlike ASICs, PLDs do not 
have to be scrapped if a spe- 


cific product is cancelled. 


In addition, because PLDs 
can be reprogrammed, it is 
possible to upgrade prod- 
ucts in the field simply by 
downloading a new hard- 
ware configuration file to 
the PLD. Bug fixes and fea- 
ture upgrades can be per- 
formed remotely without 


changing any hardware. 


Finally, long-lived legacy products using 
SUG sec anpiacem the anced more nostic 
redesign should the ASIC product or 
process be discontinued. PLD reprogram- 
mability avoids this problem. Thus, lega- 
cy products with PLDs do not siphon 
design resources from next-generation 
products, and this presents a significant 


market cost savings. 


As illustrated in the discussion of these 
three cost categories, we must look beyond 
unit cost when deciding on logic solutions 
based on cost. In many applications, the 
unit cost savings of ASICs are more than 
offset by the lower risks, design, produc- 
tion, and life cycle costs of PLDs. 


Xilinx Solutions for Total Cost Management 


Xilinx has taken the lead among PLD 
suppliers in providing low risk, low cost, 
high performance logic solutions for 
today’s rapidly changing and highly com- 


petitive markets. 


Low Power, Low Density: 


CoolRunner-II CPLDs 


CoolRunner-II RealDigital CPLDs com- 
bine high performance, ultra low power, 
low cost, and integration of specialized 
capabilities to bring you a new class of 
programmable logic devices. The versatil- 
ity and low cost of CoolRunner-II 
CPLDs allow them to serve a multitude 
of logic, interface, and control functions. 


See Figure 2. 





Figure 2 - CoolRunner-II CPLDs deliver high performance at low cost. 
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Figure 3 - Spartan-ILE FPGAs 


save board space — and money. 


The 0.18-micron CoolRunner-II family 
utilizes second-generation Fast Zero 
Power™ technology for low power, high 
performance operation. These RealDigital 
CPLDs feature an all-digital core that elim- 
inates power-hungry analog sense ampli- 
fiers. The CPLDs also offer increased capa- 


bilities, higher performance, and lower cost 


— with no performance penalty. 


Available in densities from 32 to 512 
macrocells, the CoolRunner-II family pro- 
vides performance over 330 MHz, with 
standby power consumption of less than 
100 pA. CoolRunner-II devices also deliver 
advanced system features that enable the 
integration of costly discrete system func- 
tions into a single reprogrammable device. 
This integration results in lower costs, fur- 
ther power reduction, increased reliability, 
faster time to market, and smaller designs. 
These system features include superior 
I/Os, advanced clocking features, and four 


levels of design security. 


The in-system reprogrammable nature of 
CoolRunner-II CPLDs allows designers to 
change designs on the fly, correct designs, 
and test out alternate designs without 
changing any hardware. This saves signifi- 
cant engineering change orders (ECOs) and 
NRE charges compared to ASIC solutions. 


The CoolRunner-II CPLDs advanced I/O 
interface capabilities include DataGATE, a 
programmable on/off switch for power man- 
agement; advanced interfaces such as HSTL, 


and SSTL; and Schmitt trigger inputs for 
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input signal conditioning. All of these fea- 
tures reduce the need for external compo- 


nents, reducing part costs and board space. 


To further reduce system costs and the need 
for _ additional components, the 
CoolRunner-II family provides a unique fea- 
ture called Cool CLOCK. CoolCLOCK is a 
combination of a clock doubler and clock 
divider. The incoming clock is divided by 
two and then doubled at the output to main- 
tain the same performance, while reducing 


the internal power consumption. 


To increase your cost management 
advantage, Xilinx provides a free Internet- 
based ISE WebPACK™ design tool suite 
and a free WebFITTER™ design fitting 
tool to give you a complete, pushbutton 


design solution. 


The combination of advanced interfacing, 
integrated system features, and free design 
tools makes CoolRunner-IJ CPLDs a com- 
pelling choice for logic functions in cost- 
sensitive applications. For more informa- 


tion, visit www.xilinx.com/coolrunner2. 


Moderate Density, System Integration: 
Spartan-IIE FPGAs 


Cutting-edge consumer electronics markets 
demand products that provide performance 
and flexibility in a cost-optimized format for 
home networking equipment, home theater 
systems, and other high-end personal elec- 
tronic devices. These consumer products 
must accommodate rapidly evolving stan- 


dards, shifting demand patterns, highly fluid 
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competitive positioning, and the continuous 
introduction of new capabilities. Logic 
requirements for these designs include mod- 
erate density, advanced system features, pre- 
designed and verified IP blocks, and high- 
quality design tools. Spartan-IE FPGAs 


meet all of these requirements and more. 


Xilinx designed its next-generation Spartan- 
IIE consumer FPGA with total cost man- 
agement in mind. Spartan-I[E FPGAs 
range in density from 50,000 system gates 
to 300,000 system gates. To reduce FPGA 
component costs, Spartan-IE FPGAs are 
produced by the industry's most advanced 
foundry process on 300 mm wafers. This 
process lowers die cost and _ enables 


advanced, low-cost packaging. 


To reduce board cost and improve system 
performance, Spartan-IIE FPGAs have a 
number of built-in advanced system fea- 
tures that allow you to eliminate costly dis- 
crete translators and commodity chips 


from your boards, as shown in Figure 3. 


Advanced interfacing — including LVDS, 
LVPECL, HSTL, and SSTL — remove the 
need for specialized interface components 
on the board. Furthermore, Spartan-IIE 
FPGAs offer four delay-locked loops 
(DLLs) for advanced clock management. 
Spartan-HE FPGAs also contain both para- 
meterizable block RAM (single or dual 
port) and distributed RAM (scalable and 
adjacent to logic). All these features can 
speed the design process, provide design 
flexibility, reduce design worker hours, 
deliver very high levels of performance, and 


accelerate time to market. 


As consumer products increase their digital 
capabilities, digital signal processing (DSP) 
capability has become a critical performance 
factor. The Xilinx XtremeDSP™ initiative 
enables nearly one billion DSP multiply- 
and-accumulates (MACs) per second per 
dollar of performance in Spartan-IIE devices 
— delivering advanced DSP without the 


need for specialized board components. 


Xilinx offers more than 200 Spartan-family 
IP cores to speed logic design and reduce 
engineering resources needed to complete 


the system. These IP cores are also opti- 
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mized for die area, which may permit the 
use of smaller density Spartan-IIE parts, 


resulting in further cost savings. 


Additionally, the free ISE WebPACK soft- 
ware provides the industry’s most advanced 
design tool platform to support the entire 


Spartan series. 


Die and packaging, advanced system fea- 
tures, extensive IP offerings, enhanced flex- 
ibility — all these Spartan-IIE cost control 
features give you the performance, features, 
and time to market advantage necessary to 
compete in today’s rapidly evolving elec- 
tronics markets. For more information, visit 


www.xilinx.com/spartan. 


High Performance, High Density: 
Virtex-II EasyPath FPGAs 


The Virtex-II™ FPGA is the first embodi- 
ment of the Platform FPGA concept. 
Virtex-II devices deliver SystemIO inter- 
faces to bridge emerging standards, 
XtremeDSP performance up to 100X con- 
ventional DSP solutions, multi-gigabit 
transceivers, and Empower! embedded 
processor technology. Together with the 
industrys most advanced design tools, 
Virtex-II Platform FPGAs offer a feature set 
that is unparalleled in the industry. 


Virtex-II EasyPath FPGAs bring the 
unprecedented capabilities of the Virtex-II 
family to cost-sensitive applications by 
reducing the cost of high-density FPGA 
designs. Virtex-II EasyPath Solutions use 
the exact same FPGA silicon as the Virtex- 
IT FPGA family, but Virtex-II EasyPath 
Solutions make use of a specialized testing 
program to verify the FPGAs for a specific 
customer application, resulting in higher 


yields and significantly lower costs. 


Virtex-II EasyPath solutions provide an 
FPGA volume conversion strategy with no 
risk, no investment of customer engineering 
resources, and the fastest conversion time of 
any competing high-volume strategy for 
high-density FPGA designs. The main ben- 
efits of Virtex-II EasyPath solutions are: 


¢ Lower unit costs: Customers can expect a 
30-90% price reduction compared to 


standard FPGAs. 
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¢ Lower up-front costs: These costs can be 
as low as 30% of the NRE costs of an 
ASIC or gate array. 


¢ No added engineering resources: Unlike 
other FPGA conversion strategies, Virtex- 
II EasyPath FPGAs require no additional 
engineering resources or tools. Customers 
are not required to contribute any 
resources for conversion, verification, 


qualification, or board redesign. 


No risk: Virtex-II EasyPath Solutions 


have none of the performance, timing, 


functionality, or architecture match risks 
of alternative FPGA conversion strategies, 
because the conversion FPGA fabric is the 
same as the standard Virtex-I] FPGA. 


Minimal lead times: With Virtex-II 
EasyPath FPGAs, initial orders are filled 


within six weeks and restocking orders are 


filled within days of being placed. 
Compare that to lead times of multiple 
months for both initial and follow-on 


orders for alternative conversion strategies. 


Because Virtex-II EasyPath FPGAs are tested 
for a specific design configuration, testing 
throughput and yields are significantly high- 
er, thus driving down the unit costs of the 
FPGA, as shown in Figure 4. You can initial- 
ly design your system using standard Virtex- 
II FPGAs. Once the design has stabilized, 
you can use Virtex-II EasyPath Solutions to 


convert to a lower-cost platform with none 


of the risk, lower NRE costs, and none of the 
additional engineering resources required by 
conversion to ASICs. For more information, 


visit www.xilinx.com/virtex2. 


Conclusion 


From low density, ultra low power 
CoolRunner-II CPLDs to mid-range, low 
cost Spartan-IIE FPGAs to high density 
Virtex-II EasyPath Platform FPGA 
Solutions, Xilinx presents a portfolio of 
products that can play a critical role in your 
total cost management of system design. 


These advantages include: 
¢ Cost-optimized silicon and packaging to 
minimize unit costs 


e Advanced system features to reduce costs 


through system integration 


¢ Reprogrammability to streamline design 


and debug 
¢ No-cost system upgrade capabilities 
¢ Reduced risk of obsolete inventory 


e An extensive IP library to improve design 


efficiency and reduce system components 


¢ Free design tools that maximize design 


efficiency. 


Together, Xilinx CPLDs and FPGAs give 
you a cost-optimized solution for managing 
system costs while maximizing system per- 


formance, functionality, and flexibility. & 





Figure 4 - Virtex-II EasyPath Solutions are design-specific and cost-effective. 
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In a significant move, Xilinx recently 
expanded the Virtex-II Pro™ Platform 
FPGA family two ways: 


e Five new devices offering increased 


capability 


e Lower cost versions for the entire 


family in a new reduced-speed series. 


With this, the new Virtex-II Pro family 
expands to ten devices, and sets industry 
records in every conceivable category, pro- 
viding the advanced system-level features 
and high performance you need, at 


remarkably low prices. 


The new Virtex-IIJ Pro FPGA solution, 
illustrated in Figure 1, provides every- 
thing you need, on a single, reprogram- 


mable device, including: 


e The world’s highest logic and 


memory density 
¢ The world’s highest I/O capacity 
¢ The world’s fastest DSP capability 


¢ The world’s fastest multi-gigabit serial 
I/O connectivity 


¢ The world’s most flexible parallel 
I/O connectivity 


¢ The world’s only embedded processing 
FPGA solution using IBM® PowerPC™ 


processor cores. 
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Figure 1 - Virtex-II Pro Platform FPGA 


Your demands, both for increased capabili- 
ty as well as lower cost versions of our 
Virtex-II Pro devices, were made clear at 
our Programmable World 2002 event. And 
now, we can deliver the performance and 
the pricing you need due to the success of 
our initial devices and development tools, 
as well as advances in our process technol- 


ogy which includes 300-mm wafers. 


These new, lower cost devices not only deliv- 
er tremendous cost savings over comparable 
competing programmable device offerings, 
but they also bring Virtex-II Pro capabilities 
within the reach of a much wider range of 
applications. With the Virtex-II Pro family 
you now get multiple IBM PowerPC proces- 
sors and multi-gigabit serial transceivers at 
about the same cost as our Virtex-I]™ 
FPGAs (which don’t offer these features), 
effectively providing programmable system 


features at programmable logic prices. 


The Highest Performance and the Lowest Cost 


The new Virtex-II Pro XC2VP125 FPGA, 
shown in Table 1, is the world’s highest 
capacity, highest performance, and high- 
est capability programmable logic device, 
delivering unique system performance 
such as one trillion MACs/sec DSP per- 
formance and 120 Gb/s aggregate payload 
bandwidth. There is no programmable 
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device in the world that competes with it 
in features, performance, or cost. With 
such unparalleled leadership advantage, 
the Virtex-II Pro Platform FPGAs put you 
on a platform of success, closer to your 


design goals. 


age, digital broadcast, networking, and 
wireless networking. For example, in net- 
working, as you move from the core of the 
network to the edge, the line rate and pro- 
cessing speed of the equipment decreases 
while the need for intelligent processing 
increases. In addition, price pressure 
increases, because the edge equipment is 
more price sensitive than the core equip- 
ment. This analogy can be applied to 
almost any application and market seg- 
ment. Xilinx is providing -5 devices specif- 
ically to address such price-sensitive appli- 
cations that benefit from system features 
that include embedded processors and 


multi-gigabit transceivers. 
Virtex-II Pro FPGA Architecture Review 


The Virtex-II Pro family, shipping since 
February of this year, is the world’s first 130 
nm, 9-layer copper, FPGA family and the 
first and only one to offer integrated IBM 
PowerPC microprocessor technology and 
3.125 Gb/s serial transceivers. Its architec- 
ture is built upon the award-winning 
Virtex-II FPGA fabric, which already has 


thousands of satisfied users. 


“THE VIRTEX BRAND HAS BEEN THE NUMBER ONE CHOICE OF DESIGNERS WORLOWIDE, BASED ON ITS INDUSTRY-LEADING 

CAPACITY, PERFORMANCE, CAPABILITIES, AND COST-EFFECTIVENESS. IN EACH GENERATION OF FAMILIES, OUR 

MISSION 1S TO DELIVER SIGNIFICANTLY ENHANCED FUNCTIONALITY WHILE DRIVING DOWN PRICES. WE CONTINUE 

THIS STRATEGY IN THE VIRTEX-I] PRO FAMILY BY INCLUDING STATE-OF-THE-ART MULTI-GIGABIT SERIAL TRANS: 

CEIVERS AND HIGH-PERFORMANCE POWERPE RISC CPUS, IN THE STANDARD FEATURE SET, AT NO CHARGE. THUS, 

WE ENABLE THE CREATION OF NEXT-GENERATION SYSTEMS DESIGNS AT PREVIOUSLY UNATTAINABLE PRICE POINTS. 
ERICH GOETTING — VICE PRESIDENT AND GENERAL MANAGER OF THE XILINX ADVANCED PRODUCTS DIVIICN 


Expanding the Market and Target Applications 


The new -5 speed grade delivers cost- 
effective 2Gb/s transceiver performance, 
with over 266 MHz embedded PowerPC 
processor performance. In comparison, 
our fastest devices (-7 speed grade) 
provide up to 3.125 Gb/s transceiver per- 
formance, with over 300 MHz embedded 
PowerPC processor performance. The -5 
speed grade is therefore ideal for medium 


to high-speed applications including stor- 


With 10 devices, multiple packages, and 
multiple speed grades to choose from, the 
Virtex-II Pro FPGAs give you the right 
features and performance to meet your 
specific needs. The family, as shown in 
Table 1, includes: 


¢ Immersed IBM PowerPC processors (0 to 4) 
¢ Integrated 3.125 Gb/s Rocket I/O™ 


transceivers (0 to 24) 


¢ The award-winning Virtex-II] FPGA fabric. 
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IBM PowerPC Processors 


Each immersed IBM PowerPC processor 
runs at up to 300+ MHz and 420+ 
Dhrystone MIPS providing high-perform- 
ance embedded processing. The PowerPC 
processor is supported by the IBM 
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CoreConnect™ bus technology. A com- 
plete set of embedded software tools are 
available to help you quickly develop and 
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OEM agreement, Xilinx provides software 
tools from Wind River Systems, cus- 
tomized for Virtex-II Pro FPGAs. 


“WITH THE IMMERSION OF POWERPC CORES, XILINX 
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Figure 2 - Expanding the market from ultra-high speed to medium-high speed applications 


HAS DEVELOPED THE OPTIMAL PLATFORM TO [MPLE- 
MENT OUR STORAGE NETWORK DESIGNS. VIRTEX-lI PRO 
DEVICES PROVIDE THE CRITICAL ELEMENTS OF SYSTEM- 
LEVEL DESIGN, INCLUDING HIGH-SPEED SERIAL 1/0, 
MAKING IT AN IDEAL COMBINATION FOR QUR NEXT 
GENERATION PRODUCTS. MOREOVER, THE UNIQUE 
ARCRITECTURAL SYNTHESIS DESIGN ENVIRONMENT 
ALLOWS US 10 MAKE HARDWARE AND SOFTWARE 
TRADEOFFS THROUGHOUT THE DESIGN CYCLE. 


Rocket I/O Multi-gigabit Transceivers 


The Rocket I/O™ multi-gigabit trans- 
ceivers (MGTs) are based on the 
Mindspeed SkyRail™ CMOS technology. 
Each transceiver runs at rates ranging from 
622 Mb/s to 3.125 Gb/s, and includes the 
entire transceiver support circuitry. Each 
Rocket I/O transceiver consists of both a 
digital Physical Coding Sublayer (PCS), as 
Media 
Attachment (PMA), to provide an integrat- 


well as an 


analog Physical 


ed SerDes (serializer/deserializer) function. 
Award-winning Virtex-II Fabric 


The Virtex-I] FPGA fabric provides fea- 


tures for high-performance system design 


SANDY FELTON — EXECUTIVE VP AND CTD, SAN VALLEY 


Device XC2VP2 XC2VP4 XC2VP7 XC2VP20 XC2VP50 


| __ogiccas | 3,168 | 6,768 | 11,088 | 20,800 | 30,816 | 43,632 | 53,136 | 74,448 | 99,216 
| ___—Block RAM (Kbits) | 216 | 504 | 792 | 1,584 | 2,448 | 3456 | 4,176 | 5,904 | 7,992_ 
| _N8xI8 Multipliers | 12 | 28 | 44 | 86 | 136 | 192 | 232 | 328 | 
[Digital lock Management Blocks | 4 | 4 | 4 | 68 | 8 | 8 | 8 | 8 | 12 | 
| Configuration Memory (Mbits) | 1.31 | 3.01 | 449 | 8.21 | 11.36 | 15.56 | 19.02 | 25.6 | 33.65 
PST er rose PSO | Tn | VE a a 
| __Muli-Gigahit Transceivers | 4 | 4 | 8 | 8 | 8 | 12 | 16 | 2% | 2 | 
| __Mox Avoilable User /O | 204 | 348 | 396 | 564 | 692 | 804 | 852 | 996 | 1164 | 1200 | 
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Table 1- Virtex-IT Pro Plasform FPGAs Power of Choice “Note: FF1148 and FF1696 ars support higher 
(new -5 devices shown in red) user 1/0 and zero Rocket |/0 multi-gigabit transceivers 
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that has made it a platform of choice 


for users. These features include: 


¢ High-performance logic with a wide 


choice of densities 
¢ Embedded block RAM 
¢ XtremeDSP embedded hardware 


multiply circuitry 


e Advanced digital clock management 


circuitry 


e XCITE technology for digitally con- 


trolled impedance matching on I/Os 


e Bitstream encryption technology for 


design protection 


¢ Flexible Select I/O™-Ultra technology 
for supporting over 20 single-ended and 
differential I/O signaling standards. 


Conclusion 


The expanded Virtex-II Pro family pro- 
vides a broad choice of 10 devices in scala- 
ble features and speed grades. It marks 
leadership in every facet, providing more 
capability than any other competing pro- 
grammable logic device. In addition, the 
new pricing delivers programmable system 
features at programmable logic prices, 
making embedded PowerPC processors 
and multi-gigabit transceivers standard fea- 
tures in high-performance programmable 
devices. For more information on Virtex-I] 
Pro Platform FPGAs and design resources, 


go to www.xilinx. com/virtex2pro. > 
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LOWACROSCMICyGI Mame 
to Deliver Gigabit Advanced 
Data Security with Helion 

Encryption Cores 


Virtey-ll Platform FPGAs enabled Helion Technology Ltd. 
not only to deliver on time but to exceed the client's requirements. 
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increase, software implementations of the 
, key data encryption algorithms present a 
——_ major system bottleneck. To make matters 

worse, standard off-the-shelf CPUs and 
DSPs have failed to keep pace with the 


ar computational demands of data encryption 
algorithms. Besides, CPUs and DSPs just 

 —_— y have too many other tasks to perform. 
7 Ff me f The magnitude of the software perform- 


ance gap for high-speed data encryption is 






Fm. 
lz illustrated by the benchmark results shown 
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The Design Brief 


Last year, we at Helion were asked by one 
of our clients to provide a flexible, high 
performance FPGA-based data security 
engine to handle encryption of Internet 
Protocol (IP) packet-based data for their 
next-generation network security products. 
Even though their product was based on a 
high-performance dual 64-bit MIPS net- 
work processor, benchmarking had shown 
a software encryption solution to be some- 
what short of achieving the minimum 600 


Mbps throughput our client required. 


The system requirement was for a PCI- 
based engine that could accommodate 
multiple encryption algorithms: AES and 
a proprietary algorithm for legacy purpos- 
es. The solution had to be low cost, capa- 
ble of supporting gigabit data rates, and 
flexible enough to allow for future algo- 


rithm changes. 


After evaluating the available options, the 
only solution that met all of the require- 
ments was a PCI card containing the one 
million-gate Virtex-II XC2V1000 Platform 
FPGA. The device incorporated a Xilinx 
LogiCORE™ 32-bit/66-MHz PCI core 


and Helion-designed encryption cores. 
Why Use a Virtex-I] FPGA? 


From our many years of design experience 
with leading edge technology, we were 
well aware of the potential pitfalls, such as 
inadequate design tool support and 
unavailable parts. Fortunately, Virtex-II 
Platform FPGAs use the same EDA 
toolset as earlier technologies, and the 
XC2V1000 device was already in produc- 
tion. The reasons for choosing the 
XC2V1000 instead of the more estab- 
lished Virtex-E family were compelling: 


¢ Costs — The Virtex-II] XC2V1000 cost 
less than half the nearest equivalent 


Virtex-E(-8) device. 


¢ Performance — Benchmarking with the 
Helion AES core showed the Virtex-II(-4) 
FPGA to be 30% faster than the nearest 
equivalent Virtex-E(-8) device. 
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¢ Bigger and wider block RAM — We found 
32-bit wide data buffers were more effi- 
ciently implemented in Virtex-II block 
RAM. For example, a 512x32 buffer 
required only a single Virtex-II block RAM 
rather than four in Virtex-E technology. 


Enhanced CLBs — The MUXF7 and 
MUXE8 primitives allowed up to four 
slices to be combined for fast imple- 
mentations of any logic function up to 
eight inputs wide. This led to fewer 
logic levels and faster critical paths for 
the very wide logic functions typical of 


encryption algorithms. 


Digital Clock Manager (DCM) — True 
clock frequency synthesis allowed the 
encryption data clock to be tuned to 
closely match worst-case PAR (place- 
and-route) timing, optimizing encryp- 


tion throughput. 


Easier Design Implementation — The raw 
speed of Virtex-II devices meant no PAR 


guide file was required for the 
LogiCORE 66-MHz PCI bus logic. 


Encryption Engine Overview 


A block diagram of the encryption engine 
is shown in Figure 1. The only external 
interface is the LogiCORE 32-bit/66-MHz 
PCI bus. 


All encryption key information and data to 
be encrypted (plaintext) are stored in PCI 
system memory and transferred into the 


FPGA by the direct memory access (DMA) 


Solution Clock (MHz) 


Encryption Data Rate 
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controller. This configuration allows the 
encryption keys to be easily changed by 


software — an important security feature. 


The DMA read-and-write buffers use 
block RAM to support the long PCI burst 
read-and-write transfers needed to achieve 
optimum performance. Checksum calcu- 
lation and buffering is also provided for 
the encrypted data (ciphertext) to off-load 
this task from the system software, which 
reads the checksum(s) at the end of each 
DMA transfer. 


The main data path between the DMA 
buffers is through the encryption cores. The 
Helion 128-bit fast AES encryption core had 
already been developed, so it was simply a 
case of dropping it into the design. However, 
it was still necessary to develop a high-per- 


formance core for the proprietary algorithm. 
Encryption Engine Operation 


While the encryption engine is inactive (no 
DMA encrypt transfer in progress), the 
PCI interface defaults to “target” mode and 
waits for the system CPU to initiate a 
DMA transfer. Once a DMA operation is 
requested, the PCI interface is switched to 
“master’ mode, and the DMA controller 


initiates the required bus transfers. 


Upon completion, the encryption engine 
asserts the PCI interrupt request and 
returns the interface to “target” mode so 
that the system CPU can read the check- 


sums and/or start the next transfer. 


Comments 


TMS320€62XX 32-bit 
fixed-point DSP 
~~ 


MIPS-based 64-bit 
RISC processor 





Assumes fully primed cache and 
(PU fully dedicated to encryption 


Assumes fully primed cache and 
CPU fully dedicated to encryption 


Helion Fast AES Core 2 1536 Mbps Virtex-Il XC2V1000-4 target 


Table 1 - Typical achievable data rates for 128-bit Advanced Encryption Standard solutions 
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Because the plaintext data stream written 
into PCI system memory can be frag- 
mented into multiple IP packets, the sys- 
tem software is responsible for creating a 
control data structure that details the size, 
location, and number of fragments, as 
well as the encryption type to use. This 
structure is transferred from memory into 
a block RAM in the DMA controller at 


the start of each encrypt transfer. 


The encryption engine can then read in 
all plaintext fragments, encrypt them 
using the selected algorithm, and re- 
assemble the ciphertext fragments in 
memory. In the process of writing cipher- 
text back to memory, a checksum is cal- 
culated for each encrypted fragment, and 
this value is added to a total checksum of 
all fragments. These checksums are stored 
in a small distributed RAM in the FPGA, 
and then read back by the system CPU at 


the end of each transfer. 
Encryption Engine Performance 


Our client was delighted to learn that not 
only was the final encryption engine 
design delivered on time, but that it also 
exceeded the required data throughput for 
both encryption algorithms by a signifi- 


cant margin. 


The proprietary algorithm core is 
clocked at 66 MHz to achieve a raw 
encryption throughput of 990 
Mbps. The AES core is clocked 

at 132 MHz to yield a raw 
encryption throughput of 

1.536 Gbps. 


However, these raw 
throughput figures are 
degraded somewhat in in, 
the final system due to 
bus arbitration over- 
head in the PCI bridge. 
In fact, the data throughput of the 
Helion fast AES core is so high that the 
maximum available bandwidth on the 32- 


bit/66-MHz PCI bus acts as a ceiling on 


its performance. 
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Frequent changing of the encryption keys 
will also lead to a reduction in overall data 
throughput due to the 


encryption 


increased transfer load on the PCI bus. 
Conclusion 


The encryption engine project we've 
described is a great example of how a 
combination of third-party intellectual 
property cores and Virtex-II Platform 
FPGA technology can yield flexible, 
high-performance security products in 


record time. 


A large reduction in the effort required to 
implement such a design relies on the 
ready availability of intellectual property 
like the Xilinx LogiCORE PCI products 
and the range of advanced encryption 
cores we have developed at Helion. Design 
blocks like these enable a single engineer to 
attack a million-gate design and achieve 


results in a matter of weeks. 


Figure I - DMA encryption 


engine block diagram 


Future plans for our encryption engine 
include the use of a higher bandwidth bus 
interface, such as HyperTransport™ tech- 
nology in place of PCI. We also plan the 
addition of multiple AES cores to provide a 
full-duplex (encrypt and decrypt), higher 


performance solution. 


The Helion encryption cores allow for even 
higher throughput: Eight similar Fast AES 
cores in a larger Virtex-II Platform FPGA 
could provide encryption at rates in excess of 
12 Gbps. This is fast enough to support 
emerging high-speed interconnect standards, 
such as OC-192, 10 Gigabit Ethernet, POS- 
PHY L4, and RapidIO™ Phy. 


For more detailed information on Helion and 
our products and services, please visit the 


Helion website at www.heliontech.com. ¥. 
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by Steve Prokosch 
CPLD Product Marketing 
livemlie 

steve. prokosch@xilinx.com 


When Xilinx asked design engineers what 
they expected from the new generation of 
complex programmable logic devices 
(CPLDs), their responses showed they 
pretty much wanted it all: 


e Advantages derived from state-of-the-art 


fabrication processes 
e More industry standard I/O features 
e System security 
¢ Lower power operation 
e Higher performance. 


Designers required the next generation of 
CPLDs to increase performance of bat- 
tery-powered devices. And, they stipulat- 
ed that the next generation of CPLDs 
must improve performance without 
power penalties — and with the features 
and functions that would meet the expec- 
tations of today’s discriminating designers 


and cost-conscious consumers. 


In response to the high expectations of top 
design engineers, Xilinx has created the 
CoolRunner™ II RealDigital CPLD. The 
CoolRunner-II CPLD architecture intro- 
duces, for the first time, the advantage of 
both high performance and low power 
operation. The CoolRunner-II RealDigital 
CPLD reprogrammable logic solution just 
may be the defining element of a new era 
of CPLD-based applications — small form 
factor, high performance, power sensitive 
products that are feature rich and protected 
by multiple levels of security. 
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CoolRunner-Il RealDigital CPLDs 


Evolutions of the XPLA (extended pro- 
grammable logic array) CPLD architecture 
have led to this ultra low power, ultra high 
speed, reprogrammable CMOS logic that 
offers significant improvements over earlier 
generations. (See sidebar “The Evolution of 
Low-Power CPLDs.”) These capabilities 
are aiding in dramatic device size reduc- 
tions and performance increases that are 
enhancing the look and feel of both hand- 
held and high performance products. 


The CoolRunner-II RealDigital CPLD 
delivers on several design fronts: low power, 
high performance, design flexibility, and 
security. Today, designers can choose high 
performance devices that do not require 


extra power for speed. 
Process Technology 


The CoolRunner-II family of CPLDs has 
benefited from the leading edge of Xilinx 
FPGA (field programmable gate array) 
process technology. The original 
CoolRunner CPLD architecture began on 
a 0.5-micron process and then migrated to 


0.35 micron. 


Today, the CoolRunner-I] CPLD family is 
manufactured using a 0.18-micron process 
technology — the same as the high-end 
Xilinx Virtex!™-IT FPGAs. With this ultra 
high density process technology, and scala- 
ble CMOS architecture, it is now possible 
to lower power consumption even further 
than before. 


Multiple I/O Features 


To facilitate its use in multiple system 
architectures, various input/output (I/O) 
standards have been incorporated in the 


CoolRunner-II interface. These include: 

© LVTTL 

* LVCMOS 33, 25, 18 

© SSTL2-1, SSTL3-1 

° HSTL-1 

°1.5V I/O. 

High-speed transceiver logic (HSTL) I/O 


standards are common in high-speed mem- 


ory interfaces. Stub-series terminated logic 


Summer 2002 


Cam ae if 


F.. 


line i 
— 





$° MLK 
Creathirerncucs -fi 


Figure I - Xilinx CoolRunner-II CPLD product family 


(SSTL) is associated with circuit designs 
where buses must be isolated from relative- 
ly large stubs. With these added I/O stan- 
dards, CoolRunner-II CPLDs can easily 
adapt to a variety of interfaces that other 
low power processors might not be able to 
handle. This makes CoolRunner-II CPLDs 
a great system integration solution for 


microprocessor-based applications. 
100% Digital Core 
CoolRunner-II RealDigital CPLDs do not 


use traditional sense amplifier technology — 
and that means the historical tradeoff 
between speed and power has been elimi- 
nated. RealDigital technology has enabled 
Xilinx to offer the industry's lowest power 
consumption and highest performance on 


a single device with no price premium. 
Low Power Enhancements 


Learning that designers of portable prod- 
ucts wanted even lower power than 
CoolRunner XPLA3 CPLDs, Xilinx added 
architectural features to accommodate tight 
power budgets. Even with FZP (Fast Zero 
Power™) technology and voltage reduc- 
tion, we still had to add two new architec- 
tural features to lower overall power con- 
sumption in designs for power sensitive 
applications. Those features are DataGATE 
and CoolCLOCK. 


DataGATE makes it possible to reduce 
power consumption by reducing unneces- 
sary toggling of inputs when they are not in 
use. A prime example of this would be a 
bus interface. These input signals would 
become active only when information is 
passed to the CPLD. This eliminates the 
unnecessary switching from high to 
low/low to high. As bus interfaces grow, so 
will the power demand from needless 


switching of input signals. 


CoolCLOCK, another CoolRunner-II low 
power enhancement, is implemented by 
dividing, then doubling, a clock signal. 
Using Global Clock 2 (GCK2) clock input 
and dividing by 2, then doubling the clock 
at the macrocell, performance can be main- 
tained while lowering power consumption. 
The CoolCLOCK method is easily synthe- 
sized and is available on 128 macrocell and 


larger devices. 


Schmitt Trigger 
When communicating with CPLD design 


engineers, we discovered they were looking 
for an easy method to interface to noisy 
analog components. Thus, CoolRunner-I] 
CPLDs incorporate Schmitt trigger inputs, 
which are helpful by delaying the switching 
time on slow rising or falling signals. 
Schmitt trigger inputs also help eliminate 
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false switching due to noise spikes. The pos- 
sibility also exists to form a simple oscillator 
circuit from these inputs. You must exercise 
care to assure reliability, but it can save 


component costs on simple circuit designs. 
Security 


The loss of intellectual property is always a 
concern for electronic equipment manufac- 
turers. With CoolRunner-II RealDigital 
CPLDs, security was enhanced to eliminate 
the possibility of code theft. The security 
bits are scattered and affect different modes 
of operation. If any tampering of the securi- 
ty chain is detected, the device automatical- 
ly locks down and is erased. Even if the 
device is de-capped, buried interconnects 
make it almost impossible to trace security 
connections without destroying the device. 
Security details cannot be discussed even 
under non-disclosure agreements, thus 


ensuring ultimate protection. 
User-Defined Grounds 


As voltage levels decrease, noise and 
ground bounce become larger factors in 
signal integrity. Now, a few hundred mil- 
livolts is the difference between on and 
off, versus a few volts in legacy devices. 
We had to take extra care in design and 
layout to shorten signal paths and mini- 


mize potential interference. 


Differential signaling (SSTL and HSTL) 
serves as a good method to reduce noise 
and increase signal integrity — with the pro- 
vision that extra pins are available for dif- 


ferential I/O and reference voltage sources. 


To simplify the elimination of noise and 
ground bounce, CoolRunner-II CPLDs also 
employ software-definable ground pins. 
These programmable ground pins are an 
ideal way to reduce noise and ground bounce 
effects. By using neighboring I/Os as ground 
pins, CoolRunner-II devices can tolerate a 


larger amount of potential signal noise. 
Voltage Translator Clearinghouse 


Due to the availability of various specialty 
electronic devices, it may be necessary to 
interface with multiple component voltage 


levels. With CoolRunner-II RealDigital 
CPLDs, I/O banking is an easy way to elim- 
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inate voltage level translators. “Banking” is 
the ability to separate function blocks within 
the CPLD so that these blocks can operate 
independently at different voltage levels. 


For instance, one function block could be 
I/O compatible with 3.3V devices, and 
another function block could interface 
with 1.8V devices. This saves board space 


and eases voltage level design issues. 
Conclusion 


From process technology to a 100% digital 


core to added power reduction features, 


New Product RC 


CoolRunner-II RealDigital devices mark 
the beginning of a new era for CPLD 
applications. Many system design concerns 
— including speed, power, I/O interfaces, 
clock management, and security — have 
been consolidated into one reprogramma- 
ble logic solution: CoolRunner-II 


RealDigital CPLDS from Xilinx. 


For more information, please visit www. 
xilinx.com > Products > CPLDs > Introducing 
CoolRunner-II_ RealDigital CPLDs, or 
www.xilinx.com/xInx/xil_prodcat_product.jsp? 


title=coolrunner2_page. % 


A Short History of Low Power CPLDs 


The first reprogrammable logic devices have their roots in bipolar PROM 
(programmable read-only memory) technology, with added logic capacity and fea- 
tures. The first company to generate customer interest was Signetics, with their 


introduction of the 825100 FPLA (field programmable logic array). 


Monolithic Memories Inc. attempted to be a second source supplier to the 828100, 
but failed due to a process technology mismatch. They instead developed the now 
famous PAL (programmable array logic). This development by MMI led to aggres- 
sive process technology shrinks and increased speeds as logic designers pushed for 


faster state machine operation. 


Philips Semiconductors purchased Signetics, and in 1977, acquired a second source 
license for the PAL from MMI. Philips continued to work on different architec- 
tural enhancements and migrated to a BiCMOS (bipolar CMOS) process. When 
the first 22V10 PAL from AMD hit the market, it was a greater success than any- 
one anticipated. From this beginning, devices matured into what are now known 
as CPLDs. Xilinx entered the CPLD market in the early 1990s by acquiring Plus 
Logic and introduced the 7200 family of CPLDs. 


CMOS CPLD products use power for speed improvements by partially turning on 
(biasing) transistors and by using sense amp technology. In the past, this was an 
easy way to promote devices and claim a speed advantage. The by-product, how- 


ever, was heat. 


In 1994, Philips Semiconductors made the move to CMOS CPLDs. With this 
decision, CPLDs broke away from their B1ICMOS PROM roots and entered into 
the scalable process technology arena. These products reflected innovative 
approaches in circuit design with an awareness of power consumption by removing 


the old bipolar style sense amplifier from the circuitry. 


Xilinx purchased the CoolRunner line of CPLDs from Philips Semiconductors in 
1999 to penetrate the low power CPLD market segment. Today, ultra high density 
CMOS processes (0.18 micron) make possible reduced size and lower power con- 


sumption, which directly reduces heat dissipation. 


Consumers’ demands for smaller, faster, cheaper, better products are the driving 
forces in today’s competitive markets — and the state-of-the-art CoolRunner-II 


RealDigital CPLDs meet these demands. 
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Business Continuity 


it Disaster Strikes, Xilinx Is Prepared 


War, natural disasters, terrorism, economic ruin, and other catastrophes can’t be prevented, but their effects can be 
attenuated. Xilinx has in place a Business Continuity Program designed to minimize the impact on your business. 








by Alicia Tripp 

Manager, Business Continuity Program 
Xilinx, Inc. 

alicia. tripp@xilinx.com 


Our customers depend on us — and you 
expect us to provide reliable product and 
consistent service, even in the event of a 
disaster or interruption to our business. For 
these reasons and for the benefit of our 
stakeholders, we incorporated the Business 
Continuity Program (BCP) into the Xilinx 
culture in June 2000. The BCP is a corpo- 
ration-wide initiative that is supported by 
our Board of Directors and throughout our 


employee population. 


We began the program by conducting a 
business impact analysis (BIA) and a threat 
and risk assessment (TRA) to determine 
the critical business processes within the 
company. In addition, the BIA and TRA 
provided an understanding of the impact 
of the loss of one or more of the critical 
functions. Once the analyses were complet- 
ed, more than 80 departmental recovery 
plans were developed. The recovery plans 
are the road maps that will allow us to con- 
tinue providing products and services to 


our customers during a major disruption. 
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Incident management teams (IMTs) are 
one of the most vital parts of the Business 
Continuity Program. We have located 
IMTs strategically throughout the Xilinx 
enterprise. Each team consists of members 
of functional business units within the 
company such as Human Resources, Legal, 
Facilities, Risk 
Management, Accounting, and Safety. The 


Purchasing, Security, 
IMTs are responsible for handling the 
event, from the time of a disaster through 
recovery and restoration activities. The pur- 
pose of the IMTs is to protect the health 
and safety of our employees, to guide the 
company through a crisis, and to provide a 
structured organization for overseeing the 
response. These teams will convene 
during a major business interrup- 

tion to assess the situation and 

determine if a disaster should 
be declared and whether or 
not to launch departmen- 


tal recovery plans. 


Incident management teams convened 
four times in less than 48 hours after the 
September 11, 2001, terrorist attacks to 
make plans to assist our employees. We 


were able to have counselors on site, and 






Business Continuity 
Program 


travel representatives helped employees 
traveling on business to schedule return 


flights home. 


The Business Continuity Program has been 
successfully integrated into our business 
culture — in large part, because of the top- 
level support from executive management. 
From the very beginning, our executive 
staff has been onboard to make sure the 


program goals and objectives are achieved. 


The BCP includes an executive awareness 
and ongoing plan development that helps 
our executive officers to continue to be 
involved and ready to respond. This execu- 
tive awareness plan will allow Xilinx to prop- 

erly manage any incident from the time 
a disaster is declared through recov- 


ery and restoration activities. 


Recovery plans and IMTs are exer- 
cised on a regular basis. By devel- 
oping and actively maintaining 

the Business Continuity Program, we 

are prepared to limit the effect a long-term 
business interruption might have on Xilinx — 


and to mitigate the impact on our customers. 


Please contact your local sales representa- 


tive for additional information. & 
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-ven as Designs Grow More 
Complex, Xilinx Sottware 
Improves Your Productivity 
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by Lee Hansen 

Product Marketing Manager 
Xilinx, Inc. 
lee.hansen@xilinx.com 


Your Xilinx programmable logic device 
offers you more capabilities than ever 
before, and coming releases of the ISE 
design tools will help you to handle the 
pressure that comes from using these 
complex new features. Come along for a 


glimpse into the future of ISE. 
ISE Delivers a Wealth of Device Features 


The most compelling pressure on the 
logic design flow comes from the added 
complexity of new capabilities and tool 
features. Embedded systems, increased 
clock speeds, verification of high-density 
designs, new device capabilities, high- 
speed I/O, a variety of IP (intellectual 
property) sources, and design reuse — all of 
these logic trends force more requirements 


into the design flow. 
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In 2001, the Virtex™-II FPGA intro- 
duced a revolution in clock capabilities 
with Digital Clock Manager (DCM), 
and this year, the Virtex-IJ Pro™ 
Platform FPGA broke new ground by 
delivering 3.125-gigabit serial I/O trans- 
ceivers and embedded IBM PowerPC™ 
microprocessors. As the demand for 
more device features continues to grow, 
engineers must learn all the program- 
ming attributes for these new features 

— and the learning curve escalates 

with the introduction of each new 


device feature. 


To ease the learning curve, Xilinx 
ISE 5.1i will offer designers even 
more interactive architecture and 
design assistance. In addition to 
quick and easy dialogs, wizards will 
automatically step you through con- 
figuring advanced device features and 
inserting those configurations direct- 
ly into your HDL source code. 
Upcoming releases of ISE will soften 
your learning curve, helping to speed 


design completion. 
ISE Enables IP Capture and Design Reuse 


As design sizes grow, source manage- 

ment and methods to improve source 
code productivity must keep pace. In a 
multi-million-gate design, source code 
can come from multiple resources, 
including purchased IP, developed code, 
and “design reuse” — modules of HDL 


developed and proved in a prior design. 


In upcoming releases of ISE, it will be 
possible to capture proven IP modules at 
the floorplan level. This captured IP will 
not be just a module of code, but it will 
also have attached relative placement and 
floorplanning area information that will 


help speed implementation in later uses. 


As time-to-market pressures increase, 
design change impacts late in the design 
cycle will become an even larger design 
challenge. Enhancements to ISE will 
confine late-cycle design changes to only 
that portion of the design that is 
required to change. The remainder of the 
design will be left intact, thus speeding 
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overall design completion. This technolo- 
gy is enabled by new floorplanning capa- 
bilities along HDL hierarchy boundaries 


that make it easier to define areas of logic. 
High-Speed Design Will Push Innovation 


High-speed design pressures will continue 
to push more innovation into logic design 
and into the board design flow. 
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Timing analysis and timing constraints 
have — until now — lived under two differ- 
ent analysis domains, separated at the 
FPGA pin. The logic designer has looked 
to the specification and timing require- 
ments to define design closure, while the 
board designer has picked up the signal at 
the logic pin and attempted to lay out and 
analyze the effects that the PCB trace will 


have on that signal. 


Now, however, the line between logic 
designer and board designer functions will 
begin to blur as timing constraint lan- 
guages become more uniform across the 
logic tools and among the logic-level and 
board-level tools. Trace analysis packages 
also will increasingly use more complex pin 
models provided by the logic tools, but 
these complex analyses will also become 
much faster, dumping slower general ana- 
log simulation in favor of specialized trans- 


mission line trace simulations. 


A Higher Level of Abstraction Is Coming 
While high-level languages (HLLs) have 


been explored in logic and systems design, 
the pain of existing logic design method- 
ologies hasn't been great enough to force a 
high-level language or associated method- 


ology into common use. 


Verilog and VHDL have served the logic 
design space well for the last two decades, 
but now logic design sizes are routinely 
exceeding one million gates. These mil- 
lion-gate-plus designs require large and 
cumbersome HDL source code and long 
periods of debugging time. Emerging 
embedded processor methodologies are 
also complicating the design process. 
Moreover, the need to serve two distinct 
language-driven design flows (hardware 
and software) is now making HLL design 


support imperative. 


Xilinx is investing time and technology 
in the industry-wide HLL efforts 
through partner technologies such as 
Synopsys® SystemC™ and Celoxica 
Handel-C — as well as through Forge 
Compiler, which Xilinx acquired from 
LavaLogic™ two years ago. The results 
are leading to optimized performance 
for FPGA devices, and to better inte- 
gration of the co-design and co-verifica- 
tion phases of the embedded systems 


design flow. 


Longer-term technology is also being 
defined to help you “architect” the poten- 
tial design at the HLL source code level, 
analyze what modules should be imple- 
mented in the logic design flow, and 
determine what modules are required in 
the processor design flow to reach opti- 


mum system performance. 
Conclusion 


New device capabilities will continue to push 
design complexity. The good news is that 
Xilinx design tools are keeping pace. New 
releases of ISE software will help you keep 
your design time down and your time to 
market fast. For more information on Xilinx 
ISE logic software, go to www.xilinx.com/ise 
— and watch for the release of ISE 5.1i 


coming in late summer. 8% 
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Exploring Hardware /Sottware 
Co-Design with Virtex-ll Pro FPGAs 


To explore the possibilities of hardware /sottware co-design, the Xilinx Design Services team created a reference design 
for a realtime operating system (RTOS) for telecommunications — here is an interview with the team. 


=@) 


by Jean-Louis Brelet, Manager 
APD Technical Market Research 
Xilinx, Inc. 

jean-louis. brelet@xilinx.com 


. - In this interview, Jean-Louis Brelet, 
manager of Technical Market Research 
for the Xilinx Advanced Products 


Division, interviewed Xilinx Design 


Services (XDS) software and hardware 


engineers about their practical experience 
with an RTOS reference design project. 
Senior Project Engineer Matthew Roche 
acted as spokesman for XDS. 





What is hardware/software co-design? 


Hardware/software co-design generally 
means that the hardware and software are 
developed at the same time — some software 
functions may be implemented in hardware 
for additional speed, and some hardware 
functions may be implemented in software 
to free up logic resources. 


What does XDS do? 





Xilinx Design Services, located in Dublin, 
Ireland, is part of the Xilinx Global Services 
Division (GSD). XDS has two functional 
groups: software engineers who provide cus- 
tom software solutions, and hardware engi- 
neers who provide custom hardware solu- 
tions. The combined team consists of more 
than 50 engineers. 


Typically, we help customers with their full 
product life cycle, from requirements cap- 
ture, through design implementation and 
test, to final acceptance and delivery. We 
work very closely with our customers, and 
form a team of experts who best match the 
job requirements. 
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What are your previous experiences 


with embedded software? 


We have worked on technologies that 


include: 


¢ Digital video systems — both set-top boxes 
and head-ends 


¢ High speed digital network communica- 


tions systems 
¢ Automotive applications 
¢ Real time operating systems 
¢ Board-level designs. 


Due to the nature of design services, we have 
attained experience from many different 
development environments and methodolo- 
gies. We have experience with a wide range 
of designs, for device-level to system-level 


applications. 


How would you describe the Virtex-II Pro 
FPGA family from a software perspective? 


The introduction of PowerPC™ processors 
into the Virtex-IJ Pro™ Platfrom FPGA 
architecture creates many new possibilities. 
Now we can design custom co-processors 
with innovative (and fast) interfaces. Due to 
the flexibility of the embedded IBM proces- 
sors, and their surrounding programmable 
logic, we can offload some parts of our soft- 
ware design into a dedicated hardware 


implementation that can run much faster. 


Previous hardware/software designs have 
been based around an inflexible hard- 


ware/software interface. 


When did you start 
to design for the 
next-generation 
Virtex-IT Pro 
Platform FPGA? 


As soon as the tech- 





nology became 


Jean-Louis Brelet 


available, it became 
apparent that we must evaluate the system 
in a real project situation — the RTOS ref- 
erence design. This way we could become 
aware of the new possibilities offered by the 
Virtex-II Pro device — and be best placed to 
contribute our knowledge of the advanced 


architectures used by our customers. 
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Project Objectives 
What are the main objectives of this project? 


The overall and real objective was to gain 
experience in developing a product in a 
tightly coupled hardware/software environ- 
ment. The ability to shift functions from 
software to hardware, or vice versa, in a very 
short time frame, creates an environment 
where the design teams need a much greater 


knowledge of each other’s work. 


The smaller and more restricted objective 
was to identify the speed bottlenecks in the 
RTOS and 
through the use of dedicated hardware — 


remove the bottlenecks 


effectively implementing large sections of 


the kernel in hardware. 





Before Virtex-II Pro FPGAs, how would you 
develop a system like this? 


The RTOS project is very suited to the 
Virtex-II-Pro architecture. Given the 
costs, project time, cycle time, and risk 
required to develop this project with other 
technologies — for example, an ASIC — it 
is unlikely that it would even be under- 
taken. An ASIC design doesn't offer you 
the benefit of reprogrammability and flex- 
ibility to change/improve or upgrade the 
system in the field. 


What is the expertise required to successfully 


implement these systems? 


Software people must understand the nature 
of hardware designs and the type of prob- 
lems encountered by the hardware team. 


They also must understand the possibilities 





and capabilities of the 


hardware. 


Likewise, the hard- 
ware team must have 
a good understand- 
ing of software and 
how the application 


Mathew Roche 


operates. Both teams 
must have a common language — or a good 
understanding of each other’s language 


and a willingness to adapt. 
Partitioning 


What is your approach at the system level to 
partition embedded hardware and software 
functions? 


We designed and implemented the RTOS 
reference design project for telecommunica- 
tions with resource-constrained processors. 
We built it for state-based SDL (specifica- 


tion and design language) applications. 


The application for this RTOS was to be 
in systems with high numbers of processes 
with large numbers of context switches. 
At this level, the overhead of the operating 
system would be significant and thus, 


must be minimized. 


First, we analyzed the system and identified 
the time-hungry elements. Our initial per- 
formance analysis indicated that processing 
of the state tables and managing the most 
heavily used queues (signal queues) were the 
processor-intensive tasks. The first iterations 
of the design placed these tasks into hard- 
ware with a clearly defined hardware/soft- 
ware interface. This did not completely 
remove the bottlenecks, but it did have a 
tendency to move them from the processing 


into the interface. 


Further iterations of the design required 
much closer integration of the hardware 
and software teams. This allowed the 
design of coherent and integrated tasks, 
which could be completely dealt with in 
hardware, and required minimal use of 


the interface. 


Having the processor embedded into the 
FPGA fabric allows a variety of interface 
techniques, which do not necessarily require 
the use of the IBM CoreConnect™ proces- 


Xcell Journal 25 


sor local bus. This feature became an impor- 
tant design issue, as the speed of the system 


was capable of hogging the bus. 


In our experience, we have found that there 
is no distinct division between functions 
that are suitable for hardware or software. It 
becomes a trade-off of how fast you want to 


do something and how many configurable 


At the system design level, the division of 
labor became less important as the project 
went on. It was necessary for both sides to 
have a good understanding of what the 
other team was doing. Design decisions had 
to be made with knowledge of both hard- 
ware and software. This had some interest- 
ing implications for the language and termi- 


nology used by the teams. 


SYSTEM 


Woper Software Layers 


logic blocks you want to spend on doing it. 
Greater complexity of the function or 
required speed of operation will bias the 


design towards software or hardware. 


How would you summarize your method for 


partitioning the system? 


We simply integrated the design teams, to 
avoid a purely hardware- or software-led 


solution. 
How have you split the teamwork? 


Most of the software was designed first, 
because we needed to understand how the 
system would operate before transferring 
parts of it into hardware. This gave us the 
advantage of having functioning proto- 
types before a full hardware implementa- 
tion — significantly reducing our theoreti- 


cal time to market. 
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Virtex-Il Pro Embedded Software 


Is it easier to create embedded software for 
the Virtex-II Pro fabric? 


In the first applications, embedded software 
development techniques will not necessarily 
be different for Virtex-II Pro devices — they 
can be viewed as conventional systems with 
performance improvements and reduced 
chip count. However, as experience builds, it 
will become apparent that traditional design 
methods do not use the full potential of the 
device. System partitioning will become part 
of the architectural design process, and the 
boundary between hardware and software 


will become less rigid. 


The RTOS project gave us an opportunity 
to examine the project with system co- 
design as the goal. Virtex-[I-Pro Platform 
FPGAs allow significantly more integrated 


Perspective [MRVintel arene Pro 


designs with all the performance benefits 
that entails. 


What is the most innovative feature of 
Virtex-II Pro for a software engineer? 


Using multiple processors (both IBM 
PowerPC 405 hard core and Xilinx 
MicroBlaze™ soft core processors), on a 
single chip, allow you to dedicate processes 
to particular processors for critical tasks. 
You can easily implement interfaces 
between the different processors, using 
hardware-based FIFOs, reducing the com- 


munications bus overhead. 


How easy 1s it to partition a 
Virtex-II Pro design? 


Partitions are very flexible in the Virtex 
architecture. Changes were being made to 
the interface all the way through the design 
and implementation of our RTOS system. 
This allowed us to reduce our design time 


and minimize design roadblocks. 
Conclusion 
Could you summarize your experience? 


Close cooperation between hardware and 


software teams is critical. 


Is this project an example of making 
the impossible possible? 


No. This project does not do anything 
that has been impossible before. What it 
does do is make it faster, cheaper, and 
with reduced risk. 


How can Xilinx customers benefit from 


your expertise? 


It is part of our culture to maintain high lev- 
els of expertise in advanced and state-of-the- 
art technological systems and methodolo- 
gies. As project managers, we are constantly 
improving our service and quality in dealing 
with our customers’ projects. Increasing 
competitiveness in the marketplace means 
that there is a higher emphasis on time to 
market and quality. XDS continues to 
improve its skills in order to help our cus- 


tomers achieve these challenging targets. 


XDS can be contacted by e-mail at 
designservices@xilinx.com or by visiting 


http://support.xilinx.com/xds/, & 
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by Michael Worry 
CEO, Nuvation Engineering 
michael. worry@nuvation.com 


Time to market pressures have ruled out 
designing an ASIC from scratch, but 
without a custom ASIC, the discrete 
FPGA, microprocessor, and off-board 
high-speed serial I/Os will never fit last 
years form factor. You just can't fit 16 
channels this year on a board designed for 


four channels last year. 


As a Xilinx XPERTS partner and 
an engineering services compa- 
ny working in the cross disci- 
plines of FPGAs, printed cir- 
cuit boards, and firmware, we 
at Nuvation Engineering often 
encounter these problems of 
form, fit, and function. We 
need an onboard industry stan- 
dard microprocessor, integrated 
high speed I/Os, off-the-shelf 
IP blocks, and double last year’s 


gate count. 


Xilinx has delivered on all this 

and more with their introduction 

of the revolutionary Virtex-I] Pro™ 
Platform FPGA. It starts with the techno- 
logically advanced Virtex™-II infrastruc- 
ture, supporting up to 8 million gates and 
I/O serial speeds of 3.125 Gbps. Nestled 
into this digital workshop is the IBM 
PowerPC™ 405 microprocessor — the 
number one embedded CPU for network- 
ing and telecom infrastructure. The contin- 
ued convergence of programmable logic 
with embedded systems has created a new 
digital workbench with a wealth of integrat- 
ed platform standards and rapid co-devel- 
opment tools. The Virtex-II Pro Platform 
FPGA is in a class by itself in the exploding 


programmable embedded market. 


FPGAs and Microprocessors: 
A Symbiotic Relationship 


Fundamentally, microprocessors and 
FPGAs fulfill different tasks (see Table 1). 
Programmable logic is needed for wire 
speed, high bandwidth, custom digital pro- 
acceleration. 


cessing, and hardware 


However, that speed and customization 
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© Configurable and programmable 
digital logic 


e |nstantiate multiple processing 
units to run tasks in parallel 


e Hardware accelerate high 
speed tasks 


e Scalable 


© Coded in HDL 


comes at a price, both in parts and non- 


recurring engineering (NRE) costs. 


For monitoring and control tasks, it is far 
cheaper and easier to use a microprocessor. 
Furthermore, microprocessors have existed 
longer and have a larger base of engineer- 
ing familiarity. Thus, many product archi- 
tectures naturally marry an FPGA with a 
microprocessor. At Nuvation, our complex 
designs often start with an FPGA and 


Microprocessor 


© Lower parts costs 


e Coded in firmware such as C 





Table 1 - Hardware accelerates high speed tasks 


microprocessor architecture, then we 
determine which system functions are best 
implemented in each. Xilinx takes this to 
the next level by integrating these func- 


tions into a single chip. 


A more traditional design would be a 
microprocessor integrated with dedicated 
hardwired logic in an ASIC. However, time 
to market pressures are demanding plat- 
forms with full product cycles in months 
and integration cycles in hours. Compared 
to the time for ASIC development and 
potential respins, FPGAs present a highly 


appealing alternative. 


Because the FPGA is reprogrammable 
after release, using technologies such as 
the Blue Iguana™ System, a whole new 
market of revenue streams and business 
models is opened. Xilinx has risen to 
these market pressures by systematically 
driving down the cost of customized 
hardware acceleration while producing 
continued innovation in size, speed, and 


integrated features. 


e Fixed processing units accessed by 
predefined commands 


e Faster development time 
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Virtex-ll Pro Features 


The Virtex-II Pro Platform FPGA brings 
together market-leading Xilinx program- 
mable logic with the industry standard 
PowerPC (PPC) microprocessors and high- 
speed I/Os. Truly this is the digital sandbox 


for the 21st century engineer. 
Integrate Off-the-Shelf IP 


The Virtex-II Pro device offers as many as 8 
million system gates. This allows easy and 
rapid SoC (system-on-chip) 
integration of existing IP cores 
without having to painstaking- 
ly code a custom implementa- 
tion of a standard interface to 


save a few gates. 


The Virtex-II Pro FPGA allows 
you to choose up to four PPC 


e Greater knowledge, tools, installation 
base and engineering familiarity 


built-in cores. No longer do 
you have to meticulously calcu- 
late how many clock cycles 
each firmware operation will 
take to determine the processor 
infrastructure. If the first PPC 
becomes overloaded, just 

change the part number and 
add another PPC to the Virtex-II Pro 
FPGA. You don’ have to respin the board 
or, worse yet, you dont have to deal with an 
increase in the form factor rippling to 


mechanical changes. 


The off-the-shelf features of Virtex-II Pro 
devices are matched with on-chip memory 
of up to 4.5 MB. With this onboard mem- 
ory, you can get high-speed, on-chip bus 
connects without being bottlenecked by a 


memory arbiter to off-chip memory. 
Flexible Clocking Infrastructure 


Any first year student learns the first rule 
of digital synchronous logic: never gate the 
clock. Well, Xilinx has packed that up with 
the vacuum tubes and created an architec- 
ture that allows you to switch — glitch-free 
— among as many as 16 global clocks. This 
is a huge benefit for sharing system 
resources among different clock domains. 
The Xilinx Digital Clock Manager also 
provides on-chip frequency and phase con- 
trol, meaning that an off-chip, phase- 


locked loop is no longer needed. 
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XCITE On-Chip Impedance Control 
After drawing a thousand-pin FPGA, the 


last thing a design engineer wants to deal 
with is adding a few hundred termination 
resistors. Xilinx Controlled Impedance 
TEchnology (XCITE) eliminates those 
resistors with dynamic, on-chip, digitally 
controlled impedance. The savings in 
board space, schematic time, and layout 


time are significant. 
Integrated High Speed I/O 


It seemed not too long ago 
that it was cutting-edge to 
support 64-bit, 66 MHz 
PClIs. Now we are up to 
PCI-X speeds at 133 
MHz. We have support 
for RapidlO™, POS- 
PHY4 and SPI-4_ inter- 
connect technologies, to 
name a few. Pick almost 
any kind of memory you. 
like — ZBT™ SRAM, 
DDR SDRAM, or QDR 
SDRAM, for instance — 
and they can be imple- | 
mented in the Virtex-II 

Pro logic fabric. Then we 

serial I/O 

space, and the Virtex-II 


jump into 


Pro platform can accom- 
modate as many as 16 
Rocket I/O™ multi-gigabit transceivers 
at a whopping 3.125 Gbps. This means 
applications like OC-48 can integrate 
SERDES (serializer/deserializers) right 
into the FPGA — once again, saving board 


space, cost, and valuable time. 
Integrating Customer Hard IP 


Hard IP is when digital logic is implement- 
ed in hard coded gates, removing the pro- 
gramming capability, but also using much 
less silicon. Once a design or IP block is 
stable, it can be converted to hard IP. The 
PPC 405 microprocessor is currently 
implemented in the Virtex-II Pro device as 
hard IP, but Xilinx actually allows cus- 
tomers of sufficient volume to embed their 
own hard IP. For example, if you wanted to 
roll out an existing ASIC to provide addi- 


tional capacity and integrate the micro- 
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processor, the Virtex-II Pro could embed 
that ASIC as hard IP and provide program- 


mable gates for new functionality. 
Integrated Bitstream Security 


The Virtex-II Pro device has on-chip stor- 
age capacity for a 3DES key, either with a 
battery or constant power supply. The 
stored or delivered bitstream is encrypted, 


and can only be unlocked with the on- 


chip key. The on-chip key allows Virtex-II 
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Pro devices to be designed into military 


applications that traditionally have 
required OTP (one time programmable) 
solutions to remove any chance of code 


becoming compromised. 
Reconfigurability 
The Virtex-II Pro Platform FPGA can be 


made even more powerful by enabling 
remote configuration capabilities. Xilinx 
IRL™ (Internet reconfigurable logic) tech- 
nology includes a partnership with Wind 
River Systems Inc. to produce the PAVE 
Framework — PLD API for VxWorks™ 
Embedded System Integration Framework. 
The framework enables Xilinx FPGAs 
using Wind Rivers VxWorks real-time 
operating systems and Tornado™ integrat- 


ed development environments to be recon- 


company 
= a 


figured during development and after 
deployment in the field. 


Additionally, Xilinx is in partnership with 
Blue Iguana Networks Inc., a semicon- 
ductor company that provides secure, 
remote, hardware management. Their 
patented technology enables secure soft- 
ware and hardware upgrades to in-service 
equipment over the Internet and without 
downtime. The Virtex-II Pro Platform 

FPGA is an excellent match 

for Blue Iguana’s technology, 

as the embedded micro- 
processor could upgrade the 


FPGA, or vice versa. 
Conclusion 


Because microprocessor and 
FPGA functionalities differ, as 
applications become more 
complex and diverse, it is 
often insufficient to utilize 
just one or the other. The 
*. development of the Xilinx 
~= Virtex-II Pro Platform FPGA 
resulted from the natural inte- 
gration of FPGAs and micro- 
processors. In the past, the 
only solution would have been 
to utilize multiple, discrete 
ASICs. However, because of 
the time to market require- 
ments — in addition to the 
potential expenses associated with an error 
— reconfigurable processing has become an 


ideal alternative. 


With digital design engineers facing the 
pressures of increased flexibility, shorter 
time to market, and higher clock speeds, 
they will continue to drive the demand for 
integrated standard platforms. The Virtex- 
II Pro Platform FPGA does a masterful job 
of combining a highly technologically 
advanced FPGA architecture with the 
industry standard IBM PowerPC micro- 
processor. As a member of the Xilinx Early 
Adopters Program, as well as a Xilinx 
XPERTS partner, we at Nuvation look for- 
ward to implementing this breakthrough 
technology for the benefits of our clients. 
For more information, please visit us at 


www.nuvation.com. %& 
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ele Robert Bielby, Senior Director 
Strategic Solutions Marketing 
Xilinx, Inc. 
robert. bielby@xilinx.com 


With the recently introduced Virtex-II 


Pro™ product family, the much awaited 


and highly debated immersion of a micro- 
processor into a programmable logic fabric 
= cal has finally occurred. With up to four IBM® 
PowerPC™ 405 microprocessors in the 


highest density family members, Virtex-II] 
Pro Platform FPGAs deliver more than 
2,000 Dhrystone MIPS, a number that is 


6 
unchallenged in the industry. 
Complemented by the _ integration 
of multi-gigabit serial I/O signaling tech- 


nology capable of supporting more than 3 


For anyone who has been tollowing the history of Gbps per I/O pin pais, the Vireex-II Pro 


ey : family delivers the industry’s first trul 
programmable logic, it’s pretty clear that the industry high MEME Platform FPGAs. 


has seen-moreschanges in the course of the past year 5 These advanced programmable logic 
than it has perhaps in its entire 19 years of its existence. 


devices are unparalleled in their capabili- 
ties not only to address a wide range of 
the most demanding applications — 
but also to adapt and adopt emerging 
I/O standards and protocols as they 


become available. 


Advances in semiconductor and program- 
mable logic fabrication have not only 
made families like the high-end Virtex-II 
Pro Platform FPGAs possible, but on the 


other end of the spectrum, technology 


advances have also enabled the recent 
delivery of Spartan™-IIE FPGAs and 
CoolRunner™-II_ RealDigital CPLDs. 
The Spartan-IIE family sets a new stan- 
dard for low cost within the programmable 
logic industry, and the CoolRunner-II 
series sets an equivalent benchmark in the 


industry for ultra low power consumption. 


This premiere portfolio of programmable 
logic solutions gives Xilinx the power to 
address a wide range of new markets and 
applications, including consumer electronics 
and handheld devices. In this issue of Xcell 
Journal, we discuss some of the markets 


we ve opened and applications we've enabled. 
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Metropolitan Area Networks 


The hottest areas in networking today are 
metropolitan area networks (MANs) and 
“edge access” markets. Driven by the explo- 
sive growth of data traffic in metropolitan 
regions, this networking market segment is 
expanding at a 34% CAGR (compound 


annual growth rate). 


New products and technologies based upon 
standards proposals that leverage the com- 
bined benefits of Ethernet and SONET are 
now being rushed to market to address this 
rapidly growing demand for data services. 
However, the uncertainty surrounding the 
overall acceptance of these new standards — 
and the rigorous requirements of supporting 
line rates in excess of 10 Gbps — have given 
rise to the need for solutions like Virtex-II 
Pro Platform FPGAs that offer both flexibil- 
ity and high performance. 


In addition to articles discussing the chal- 
lenges of developing solutions for the 
MAN, we are also pleased to present — in 
conjunction with Avnet Design Services, 
Avnet Cilicon, and Reed Electronics Group 
— an industry-wide event: the Metro- 
Optical Networking Forum to be held July 
25 at the Santa Clara Convention Center in 
California. This free one-day event (which 
will be simulcast to more than 40 North 
American venues) is focused entirely on the 
latest developments in metropolitan and 
edge access networking. For more informa- 
tion, see the article and ad in this issue of 
Xcell Journal. To register for this event, visit 


www.xilinx.com/metro. 
Consumer Electronics 


Consumer electronics continue to grow at a 
very healthy pace, fueled by the trend 
towards digitization. Consumers perceive 
that anything digital is better. This con- 
sumer perception started with the introduc- 
tion of the digital clock. The digital clock 
was more accurate than the analog clock 


and over time, more desirable. 


Miniaturization drove the development of the 
digital watch, handheld calculator, and a myr- 
iad of other consumer products based upon 
digital technologies. Naturally, a plethora of 


new standards and protocols have emerged. 
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Multiple standards, changing protocols, 
high-performance processing, low power 
requirements at cost-conscious consumer 
prices — these are the challenges where the 
low cost Spartan™ series of FPGAs and 


the ultra low power CoolRunner™ 


CPLDs shine. 
Digital TV 


With a heightened awareness and demon- 
strated demand for digitized electronics, the 
television industry logically went about 


defining a new television format based 


entirely upon digital technology. The results 
of high-definition digital TV (DTV) were 
astounding, but the price tag was prohibi- 


tively expensive, and consequently, con- 
sumer interest was low. Until just recently, it 
was hard to pinpoint exactly why consumer 
interest in DTV languished — was it because 
of price, or did consumers just think that 


digital video was not interesting? 


Actually, it has only been in the past five 
years that the story has slowly unraveled. 
Consumers have, in fact, voted strongly in 
favor of digital video — but this has come, 
not in the form of an end-to-end digital 
system, but rather in the form of a single 
video component: the digital video disc, or 
DVD. Never, in the history of any con- 
sumer electronics product, has there been 
such a wide and rapid acceptance of any 
new medium or format as there has been 
with the introduction of the DVD. The 


DVD represents one of the most successful 


consumer electronics products in history. 
According to International Data Corp, 
DVD had penetrated 23% of US house- 
holds by end of 2001. 


The success of the DVD is attributed to sev- 
eral factors: the significant improvement in 
image quality, the industry-wide standardiza- 
tion that reduced consumer confusion, and 
high-volume sales, which led to accelerated 
cost reductions. Another key factor in the 
widespread acceptance of DVDs was that they 


can be viewed on existing analog televisions. 





Now, as digital video has grown in popular- 
ity, it is spawning a host of new digital prod- 
ucts — and formats — including plasma dis- 
plays, personal video recorders, read/write 


DVDs, and set-top boxes. 
DTV has also spawned a widespread con- 


cern in the “infotainment” industry over 
video piracy. This concern is giving rise to a 
suite of new encryption standards, including 
5C, 4C, CPTWG, and TCPA (to mention a 
few) that are being proposed by a wide range 
of interested parties, including the motion 


picture industry. 
Conclusion 


In this issue of Xce/, we discuss some of the 
exciting Xilinx solutions for consumer elec- 
tronics, with a special emphasis on digital 
video applications, such as video image pro- 
cessing, navigating changing standards, and 
addressing encryption challenges. We hope 
you enjoy these articles. %& 
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FPGAs — Enabling DSP in 
ReatTime Video Processing 


With Xilinx FPGAs you can create DSPs that are much 
faster than any off-the-shelf stand-alone DSP device. 
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by David Nicklin, Senior Manager 
Strategic Solutions, Wireless 

Xilinx, Inc. 

dave.nicklin@xilinx.com 


Real-time video processing makes enor- 
mous demands on system-level perform- 
ance requirements and, except for the sim- 
plest of functions, is well beyond the capa- 
bilities of any general purpose DSP 
device. Therefore, DSP-based video 
designs often require several DSP devices 
to get the necessary throughput, along 
with the overhead of multiple program 
and data memory resources. However, 
designing with programmable logic allows 
you to implement video signal processing 
algorithms using parallel processing tech- 
niques, giving you the performance you 
need within a single, highly flexible pro- 
grammable logic device. By performing 
video processing functions in real time, 
you can minimize the need for frame 


stores and data buffering. 
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With the industry pushing for higher levels 
of video quality, and the development of 
improved compression schemes, system 
processing rates have increased dramatically. 
At the same time, we have introduced new 
programmable logic devices, such as the 
Xilinx Spartan™-ITE FPGAs, which draw 
on the architectural heritage of the FPGA 
devices that are commonly used in profes- 
sional broadcast equip- 
ment today. As FPGA 
process development fol- 
lows Moore's Law, our 
new products can perform 
the same functionality as 
their predecessors, with 
even higher performance 


and a much lower cost. 


Preprocessor 


By developing systems 
using our latest cost-effec- 


tive FPGAs, 


bring unprecedented lev- 


you. can 


els of professional, broad- 
cast quality, video pro- 
cessing into areas of digi- 
tal video technology such 
as high-end consumer 
products, security sys- 
tems, industrial and 
machine vision applica- 


tions, and frame-grabbers. 


One driver of this trend, is 
the combination of net- 
working, broadcast, pro- 
cessing and display technology, in what the 
industry has termed “digital convergence.” 
The need to send high bandwidth video 
data over extremely difficult transmission 
channels, such as wireless, while still main- 
taining an acceptable quality-of-service 
(QoS), is extremely difficult. This has lead 
to wide ranging research in how to improve 
error correction, compression and image 


processing technology, much of it based on 


advanced FPGA technology. 


Image Compression /Decompression — 
DCT/IDCT 


The main video compression scheme 


used in digital video systems today is 


MPEG2. It can be found at the heart of 
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fH !mplementation undefined 
in standard 


[1] Defined in the standard 


digital television, set-top boxes, digital 
satellite systems, high-definition televi- 
sion (HDTV) decoders, DVD players, 
video conferencing equipment, and Web 
phones, to name just a few. Raw digital 
video information invariably has to be 
compressed so it can be either sent over a 
suitable transmission channel, or stored 


onto a suitable medium such as a disc. 
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Real-Time Video Processing 


blocks that are left undefined. It is within 
these undefined sections of the standard 
that a company can truly differentiate its 
product from its competition and develop 
its own proprietary algorithms. Many pro- 
fessional MPEG encoders use FPGAs in 
these sections, such as the motion estima- 
tion block, as shown in Figure 1. Because 


FPGAs are reconfigurable, the equipment 
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Figure 1 - Simplified video encoder diagram 


There are also a number of emerging stan- 
dards, including most notably MPEG4. 
Most products based around this technolo- 
gy are still in development, although ramp- 
up to production is expected shortly. At the 
heart of the MPEG2 and MPEG4 algo- 
rithms is a function called the discrete 
cosine transform (DCT). The aim of the 
DCT is to take a square of pixel blocks and 
remove redundant information that is 
imperceptible to the viewer. To decompress 
the data, the inverse discrete cosine trans- 


form (IDCT) function is required. 
While the DCT section of the MPEG 


algorithms is standardized and can be 


implemented very efficiently within 


FPGAs, MPEG encoding has a lot of 


* Search Patterns 
* Evaluate Modes/Options 





can be easily updated to incorporate new 
algorithms at all stages of development, 
including in the field after deployment. 
Companies that rely totally on standard 
ASSP solutions are limited in their ability 
to produce products that can make them 
stand out from the competition, and they 
therefore run the risk of being seen as just 
one of a number of similarly specified 


solutions in the market. 
Color Space Conversion 


Another important part of a video system 
is the requirement for color space conver- 
sion, a process that defines how an image 
specified in one color format can be con- 


verted into a different color format. 
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Figure 2 - Color space cube 


Receptors in the human eye are only capable 
of detecting light wavelengths from 400 nm 
to 700 nm. These receptors are called cones 
and there are three different types, one for 
red light, one for green, and one for blue. If 
a single wavelength of light is observed, the 
relative responses of these three sensors 
allow us to discern what we call the color of 
the light. This phenomenon is extremely 
useful because it means we can generate a 
range of colors by simply adding together 
various proportions of light from just three 
different wavelengths. The process is known 
as additive color matching, and is used in 


color television systems. 


It’s possible to represent colors of light by 
plotting the red, green and blue (RGB) 
components proportions on a 3-dimen- 
sional cube, with black at the origin and 
while at the diagonally opposite corner. 
The resulting cube is known as the “RGB 


color space,” as shown in Figure 2. 


Whether the final display medium is paper, 
LED, CRI, or plasma displays, the image is 
always broken down into an array of pic- 
ture elements or pixels (HDTV, for exam- 
ple, can have 1920 x 1080 pixels). While 
the mechanics change slightly for each 


medium, the basic concept is that each 
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pixel displays a proportion of red, green, or 
blue depending on the voltage signals driv- 
en to the display. 


Processing an image in RGB format, where 
each pixel is defined by three 8-bit or 10- 
bit words corresponding to each primary 
color, is certainly not the most efficient 
method. With such a format, every action 
on a pixel has to be performed on all the 
red, green, and blue channels. This invari- 
ably requires more storage and data band- 


width than other alternative color formats. 


To address such issues many broadcast 
standards, such as the European PAL and 
North American NTSC television sys- 
tems, use luminance and color difference 
video signals. A requirement therefore 
exists for a mechanism to convert between 
the different formats, and this is called 


“color space conversion.” 


Realizing these circuits in hardware is a rel- 
atively simple task, once the coefficients to 
map from one plane to another are known. 
One common format conversion is RGB to 
YCbCr (and conversely YCbCr to RGB). 
See Figure 3. It has been found that 
between 60% and 70% of the luminance 
information (Y) a human eye can detect 


comes from the green color. The red and 
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blue channels in effect duplicate much of 
the luminance information, and hence, this 
duplicate information can be safely 
removed. The end result is that the image 
can be represented as signals representing 
chrominance and luminance. In this for- 
mat, the luminance is defined as having a 
range of 16 to 235 in an 8-bit system, and 
the Cb and Cr signals have a range of 16 to 
240, with 128 being equal to 0. 


A color in the YCrCb space is converted to 
the RGB color space using the equations: 


R’ = 1.164 (Y — 16) + 1.596 (Gr — 128) 


G’ = 1.164 (Y — 16) — 0.813 
(Cr — 128) — 0.392 (Ch — 128) 


BY = 1.164 (Y — 16) + 1.596 (Cr — 128) 


R’G’B’ are gamma corrected RGB values. 
A CRT display has a non-linear relation- 
ship between signal amplitude and output 
intensity. By gamma correcting signals 
before the display, the relationship between 
received signal amplitude and output 
intensity can be made linear. The output 
gain is also limited below certain thresholds 
to reduce transmission-induced noise in 


the darker parts of an image. 


There are a number of possible implemen- 
tations — memory, logic, or embedded 
multipliers — to perform the necessary mul- 
tiplication functions. It is certainly possible 
to meet and exceed the 74.25 MHz data 
rate required for HDTV systems. It is also 
possible to try different design trade-offs, 
such as that between system accuracy and 
design area. For example, for a 3% error in 
luminance, the design size of a YCrCb-to- 
RGB color-space-converter can be more 
than halved. This may be unacceptable in 
most display products but could be accept- 
able in other applications such as machine 
vision or security systems. By using an 
FPGA, you can tailor the algorithm for the 
application, thereby maximizing perform- 


ance, efficiency, or both. 


Real-Time Image and Video 
Processing Functions 


Limitations in the performance obtainable 
with standard DSPs has led to the develop- 


ment of specially designed chips, such as 
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media processors. However, these devices 
have often proved to be too inflexible in all 
but a narrow range of applications, and can 
suffer from performance bottlenecks. The 
limitations of a processor-based approach 
become especially apparent in high-resolu- 
tion systems, such as HDTV and medical 
imaging. Fundamentally, a processor solu- 
tion is restricted in how many cycles can be 
allocated to each tap of a filter, or each stage 
of a transform. Once the performance lim- 
its have been reached there is often no other 
way around the problem than to add extra 


DSP parts. 


An FPGA, however, can be custom tailored 
to provide the maximum efficiency of utiliza- 
tion and performance. It’s possible for you to 
trade off area against speed, and invariably 
perform a given function at a much lower 
clock rate than a DSP would require. 


For example, Visicom Inc. found that for 
a median filter application a DSP proces- 
sor would require 67 cycles to perform the 
algorithm. An FPGA needed to run at 
only 25 MHz, because it could realize the 
function in parallel. For the DSP to 
match the same performance, it would 
have to run at over 1.5 GHz. In this par- 
ticular application, the FPGA solution is 
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some 17 times more powerful than a 


100 MHz DSP processor. 


There are a wide number of real-time image 
and video processing functions that are well 
fitted for implementation in FPGA devices 


— these include real-time functions such as: 
¢ Image rotation 

° Scaling 

¢ Color and hue correction 

e Shadow enhancement 

e Edge detection 

¢ Histogram functions 

¢ Sharpening 

¢ Median filters 

¢ Blob analysis. 


Many of these functions are both applica- 
tion specific and system specific, and are 
based around core structures such as 2D- 
FIR filters. Such functions can be imple- 
mented quickly using HDL language 
design or by exploiting the DSP building 
blocks found in high-level core design tools 
such as the Xilinx CORE Generator™ 
software. It’s also possible to reduce both 


design and simulation time by employing a 


1.164 


x 





system-level design approach, using prod- 
ucts such as The MathWorks MATLAB™ 
and Simulink™ software, as well as Xilinx 


System Generator. 
Conclusion 


With digital convergence, you need to 
integrate various standards and require- 
ments into one homogeneous product — 
this demands flexibility in design and 


implementation. 


FPGA technology is addressing the system 
requirements of new and emerging video 
applications, bringing with it the features 
and signal processing performance that have 
made it the preferred solution for video and 
image processing in the professional broad- 
cast equipment market for a number of 
years. The latest generation of FPGAs can 
provide the same level of performance and 
functionality, at a fraction of the cost of the 
FPGAs that were designed into professional 


equipment just a few years before. 


In comparison to ASSP and chipset solu- 
tions, FPGAs offer levels of flexibility 
that designers demand in today’s prod- 
ucts, while at the same time maintaining 
a distinct performance advantage over 


conventional DSPs. & 
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Figure 3 - YCrCb-to-RGB color space converter 
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Perspective Dyfelicol mAs 


° by Robert Green, Manager 
Strategic Solutions, Digital Audio-Video 
Xilinx, Inc. 
robert.green@xilinx.com 


Broadcasters are very interested in creat- 


ing new revenue from digital television 


e oF e ® services. However, a vast range of evolving 
digital television (DTV) standards in 
transmission, image processing, intercon- 
nectivity, and display interfaces can create 


considerable barriers to speedy implemen- 
tation. Yet, the industry promotes digital 


@ ® broadcasting, highlighting better video 
and sound quality to attract new sub- 
scribers from the analog world. Consumer 
adoption of digital TV is important if 
broadcasters are to adhere to the schedule 

| . — i | | | - V4 

4 : ; | 


of analog switch-off. 


” 


Bandwidth is precious; to make the most of 


it, compression schemes have steadily 

improved and new algorithms are in devel- 

opment to push the envelope even further. 
Be Be 






As such, system-processing rates have 





increased over time, and real-time image 


processing is now an ideal way of meeting 





these requirements while removing memo- 





ry overhead. At the same time, Moore's Law 
has resulted in low cost programmable logic 
devices, such as the new Spartan™-IIE 
FPGAs, that provide the same functionality 


and performance previously only found in 


expensive professional broadcast products. 


Which Standard Do | Use? 


The transition from analog to digital tech- 


nology is providing opportunities for 
broadcasters to offer new services, and 
competition is fierce between OEMs to 
produce cost-effective systems that are 
attractive to consumers. As with many 
technology shifts, this introduces many 
new proposed standards as companies vie 
for market leadership. It’s possible that the 
first company to market becomes a de facto 
standard regardless of the efforts of stan- 
dards bodies to ensure interoperability and 
fair competition. Even when standards 
bodies successfully produce internationally 
recognized specifications, there are often 
many versions of a standard that can be 
adhered to as they try to address many 
member companies’ needs. Add to this the 
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Respective PETRI 


inevitable revisions to standards during 
development, and it is easy to see why the 
flexibility of a fully reprogrammable solu- 


tion is so compelling. 


For example, your return on investment 
would be much greater if you were able to 
reprogram your system so that it complies 
with the latest revision of an emerging for- 
ward error correction algorithm. Replacing 
an aging encryption scheme such as DES 
with a new, more robust version (AES) is 


another example. 


Table 1 shows a summary of broadcast for- 
mats defined by the Advanced Television 
Standards Committee, and is referred to in 
the broadcast industry as “Table 3” (from 
the ATSC A53 specification). There are 36 
options that equipment manufacturers 
need to support in this table alone. There 
are also regional variations to broadcast 
standards, with DVB, ISDB, and the pro- 
posed DMB used in different parts of the 
world. Although ASSPs 
Specific Standard Products) are available, it 


(Application 


frequently means that different chips are 


required for each format. 


An FPGA solution, easily supporting data 
rates exceeding those required by HDTYV, 
would enable all the formats to be support- 


Definition Lines /Frame 


Table 1 - ATSC broadcast formats 


ed with one device that is reprogrammed 
depending on the equipment or operational 
mode it is used in. Such flexibility in a stan- 
dard, off-the-shelf component reduces your 
bill of materials and removes any risk of 


supply problems from an ASSP provider. 


Pixels and Performance 


With the advent of high-definition broad- 
casts in many parts of the world, video sig- 


nal processing requirements have increased 
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Pixels /Line 


dramatically. For example, an HDTV dis- 
play of 1920 x 1080 resolution, 24-bit pix- 
els, and 30 progressive frames per second, 
requires a total uncompressed bandwidth 
of around 1.5 Gbps. Even in areas where 
high-definition images aren't actually 
broadcast, HD pictures are still used in all 


post production stages . 


The latest low cost programmable logic 
devices (the Spartan series) now have mul- 
tiple I/Os with LVDS (low voltage differ- 
ential signaling) that support these sorts 
of data rates. Therefore, even uncom- 
pressed video data can be brought on- and 
off-chip and processed in real time. 
Performing real-time video processing, 
even at HDTV rates, allows you to reduce 


external memory requirements. 


Multiple frame stores and data buffers are 
often used in current digital TV systems 
because the video signal processor portion 
of the design becomes a bottleneck. Using 
the parallel signal processing capabilities of 
FPGAs means that smaller, or even single, 
frame stores can be used, and data buffers 
can be eliminated. The limitations of stan- 
dard DSPs for real-time image processing 
has led to the development of specially 
designed devices. Although performance is 


Aspect Ratios 





greatly improved, these devices have often 
proved to be too inflexible in all but a nar- 
row range of applications and can still suf- 


fer from performance bottlenecks. 
FPGAs in Consumer Digital TVs 
FPGAs have historically only been found 


in expensive professional broadcast sys- 
tems. However, due to low costs, Spartan 
series FPGAs can be used in high volume 
consumer products such as digital TVs 
where set-top box functionality is fully 


Frame Rates 
23.976p, 24p, 29.97p, 29.97i, 30p, 30i 
23.976p, 24p, 29.97p, 30p, 59.94p, 60p 
23.976p, 24p, 29.97p, 29.971, 30p, 30i, 59.94p, 60p 
23.976p, 24p, 29.97p, 29.971, 30p, 30i, 59.94p, 60p 


integrated, allowing it to receive digital 
broadcasts. FPGAs can have many areas of 
responsibility within digital TV sets, as 
shown in Figure 1. Interfacing between 
standard chipsets, as “glue logic,” has 
always been a strong application of FPGAs, 
but many more image processing tasks 
(such as color space conversion) and sup- 
port for network interfaces (such as IEEE 
1394) are now possible in low cost pro- 


grammable devices. 


One driver of the trend to offload image- 
processing tasks to FPGAs is the result of 
the industry-coined phrase “digital conver- 
gence.” This arises from the need to send 
high bandwidth video data over new and 
extremely difficult transmission channels, 
such as wireless networks, while still main- 
taining an acceptable quality-of-service 
(QoS). This has lead to wide ranging 
research in how to improve error-correc- 
tion, compression and image processing 
technology, much of it based around 


implementation in an FPGA. 
Product Differentiation 


By using FPGAs, it is possible for you to 
differentiate your standard-compliant sys- 
tems from your competitor’s products. 
With the MPEG-2 compression scheme, 
for example, it’s 
possible to offload 
the IDCT (inverse 
discrete cosine 
transform) portion 
of the algorithm 
from an MPEG 
processor, to an 
FPGA, to increase 
the bandwidth. 
IDCT (and DCT 
at the encoder) can be implemented 
extremely efficiently using FPGAs, and 
optimized IP cores are readily available to 


include in MPEG based designs. 


By integrating proprietary techniques for 
other image processing functions alongside 
defined blocks such as DCT, it’s possible to 
produce a low cost, single chip solution 
that increases processing bandwidth and 
gives higher quality images than your com- 
petitor. By avoiding systems that rely just 
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on standard ASSP solutions, you no longer 
need to be perceived as providing just 
another product in a number of similarly 
specified products in the market. Plus, with 
FPGAs you can easily modify your design 
at any time to meet new standards or fix 
bugs — even after your product is in your 


customers hands. 
Time to Market and Time in Market 


As well as enhancing DTV systems and 
increasing processing performance, FPGAs 
also get your products to market quicker 
and keep them generating more revenue 
once they are in the field. Xilinx FPGAs 
are based on SRAM technology, which 
allows you to easily reprogram the device 
during the development phase. This allows 
simple debugging of your system, but also 
enables last minute changes to be made to 


the product if needed. 







RF in 





Home 
Network 


Satellite 


Because FPGAs are reprogrammable, 
there’s no waiting for development of a 
new ASSP chipset to support the latest 
revision, or a costly and time consuming 
ASIC re-spin if chip development is done 
in-house. New ways of programming 
devices in the field are giving the added 
benefit of being able to transmit hardware 
updates via the Internet or some other 
communications channel. For example, 
Xilinx IRL™ (Internet Reconfigurable 
Logic) technology allows you to auto- 
matically reconfigure your product, 
remotely, at any time. The power of this 
feature is highlighted in situations where 
first-to-market is key, such as in con- 
sumer set-top boxes. It means that boxes 
can be shipped to customers without 
necessarily having a complete design — 


features can be added later. 
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Figure 1: System diagram of high-end digital TV set 


38 Xcell Journal 


HLOM Oe) alice) 


Perspective MP 


Conclusion 


FPGAs provide both professional and con- 
sumer digital broadcast OEMs with real-time 
image processing capabilities that address the 
system requirements of new and emerging 
video applications. Compared to other tech- 
nologies, FPGAs offer an unrivalled flexibili- 
ty that enables you to get your products to 
market quickly. Remote field upgradeability 
means that systems can be shipped now and 


features, upgrades, or design fixes added later. 


FPGAs also have significant performance 
advantages over conventional digital sig- 
nal processors and have the added benefit 
of allowing you to differentiate your 
products from the many solutions on the 
market. Programmable logic is the ideal 
balance between the flexibility of DSPs 
and the performance of ASSPs for digital 
TV development. & 
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Comprehensive Security Design 


by Karen Parnell 


Product Marketing Manager, Automotive 


Xilinx Iechnologyl 


Researchers at the University of California 


. at San Diego have found a way to “blow 
(] { We Q { up” silicon chips with an electrical signal, 
according to a recent article, “Exploding 


Chips Could Foil Laptop Thieves,” in 


NewScientist.com. 
QC () { QS el The practical application of self-destroying 
chips would be the ability to remotely 


disable stolen laptops, cell phones, 


PDAs, and other devices using wireless 


communication and containing confiden- 
tial information. 


According to the NewScientist.com article 
by Duncan Graham-Rowe, when a device 
is stolen, the victim or service provider 
could call the device and transmit a self- 
destruct order to “detonate” a small quanti- 


ty of gadolinium nitrate. The resulting 


Mobile phone theft is on the rise, but cell phones sqlodnn ould canatiee il cic 
equipped with CoolRunner-lI CPLDs, Internet rendering the stolen device useless — and 


irreparable. Unfortunately, information in 


Recontigurable Logic, and Comprehensive Design i ee ee el 
Security could remove-tne incentive fo steal; and that data could sometimes be more 


useful than the device itself. 





(0) 





At Xilinx, we have a better idea. With 
IRL™ (Internet Reconfigurable Logic) 
technology, Comprehensive Design 
Security, and CoolRunner™-II CPLDs, 
we can remotely disable up to four differ- 
ent functions of a stolen device, rendering 
it inoperable, and with locked-down, tam- 


perproof data. The beauty of the Xilinx 
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solution is the unit is not damaged in any 
way — full functionality and data can be 


easily restored if the device is recovered. 
Whose Problem Is It? 


Every three minutes in the UK, a mobile 
phone is stolen, sometimes at knifepoint. 
This phenomenon is not just confined to 
the UK. It is a global issue and continu- 
ing to rise. For example, in New South 
Wales, in Australia, nearly 40,000 phones 
were stolen between October 1999 and 
September 2000, a 100% rise over the 


previous year. 


But what if the phone hardware could be 
disabled and effectively rendered useless, 
then surely the need to steal this much-cov- 
eted possession would decline. Wouldn't it? 


Not necessarily ... 


When a mobile phone is stolen, it is rela- 
tively easy for the phone operator to disable 
the Subscriber Identity Module (SIM) card 
inside the phone so that calls can not be 
made using that particular number. This is 
of great benefit to the bill player, but this is 
not usually why the phone has been stolen. 
Hackers can install a modified SIM card 
that fools the phone into believing it is a 
prepaid, “pay-as-you-go” phone with 
unlimited credits. This allows the phone to 
be used indefinitely, free of charge. 


What Are Today’s Solutions? 


In March 2001, the then-UK Home 
Secretary Jack Straw said that he would 
meet with the world’s mobile phone manu- 
facturers and police to try to stop the grow- 
ing threat of street robberies that target 
mobile phones. Among the measures con- 
sidered were the need for general vigilance, 
discrete usage, knowledge of security codes 
to lock the phone, and the use of an 


International Mobile Equipment Identifier 
(IMEI) number. 


In addition to the SIM card, digital mobile 
phones connected to the GSM (Global 
System for Mobile Communications) net- 
work possess an IMEI identifier that is 
associated with the handset itself. (GSM is 
the dominant cellular system used in 


Europe and Asia.) 
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Mobile phone owners are encouraged to 
note down this number so that if the 
handset is stolen, then the operator can 
bar that specific phone from working on 


their network. 


But what about the other networks? The 
handset can theoretically still be used on 
other networks if a suitable SIM can be 
found on the black market. 





The need, therefore, is to find a way to 
completely disable the handset hardware 
on any cellular system — without blowing 
up the phone itself. Thus, the phone is use- 
less to the thief; but it can be reactivated 
and restored if the phone recovered and 


returned to its rightful owner. 


This message was again reinforced in 
January this year with UK ministers consid- 
ering whether to introduce legislation that 
will force networks to introduce the afore- 
mentioned anti-theft measures, but, they 


stipulated, this would be “a last resort.” 


What Is Tomorrow’s Solution? 


The average mobile phone thief is becom- 
ing more discerning, because in the black 
market, the newer “smart phones” are 
becoming the target of choice. To date, the 
growth of mobile communications has 
been fuelled by voice calls, but voice is 
becoming a saturated market, with increas- 
ing competition forcing a decline in aver- 


age revenue per user. 
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To maintain growth and even sustain current 
income and profits, cellular providers must 
introduce new services that will be attractive 
and beneficial to users, who will then need 
new types of technically sophisticated hand- 


sets from the providers’ manufacturers. 


In order to capture the market for replace- 
ment cellular handsets, the next wave of 
mobile phone handsets must be smarter, 
lighter, and last longer on one charge. 
These new smart phones are reinvigorating 
the flagging mobile market by providing 
must-have functionality. The cell phone 
has now moved from being just a humble 
voice communications device to a com- 
bined PDA, Internet access device, MP3 


player, and games console. 


For instance, Nokia recently unveiled its 
latest “lifestyle” function phone, the 5510. 
The phone includes an integrated digital 
music player (with 64 MB memory) for 
ACC (Advanced Audio Coding) and MP3 
files, a full keyboard to facilitate better 
messaging functionality, and a handful of 


embedded games. 


Contrary to recent times, the full cost of 
smart phone will be borne by the customer 
— not subsidized as a free loss leader by a 
wireless service provider, who intends to 
recover the cost of the hardware with inflat- 


ed air-time rates. 


These smart phones are more costly for the 
owner and/or network operator to replace 
if stolen. This cost risk has made the 
mobile phone companies rethink how their 
products are designed. The hardware of 
handset must have the built-in capability to 
be remotely disabled if stolen — but still 
traceable via GPS (global positioning sys- 


tem) or similar geographic locator systems. 


Once recovered, the phones must be able 
to be reactivated to full functionality when 
returned to their owners. A phone with 
blown-up chips isn’t worth recovering. To 
be a viable consumer product, the disabling 
component of a stolen cellular phone must 
be small, lightweight, low cost, low power — 
and capable of fully restoring the function- 


ality of the phone upon recovery. 
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The Xilinx Solution 


With typical forward 
Xilinx has 
already developed the 


thinking, 


tools to combat phone 
theft — IRL technology, 
Comprehensive Security 
Design, and the 
CoolRunner-IT CPLD. 


Bluetooth 
Coxe [0] f= 


If a  CoolRunner-II 
CPLD were used to 
perform the keypad 
interfacing in a handset, 
then if it were stolen, the 
keypad could be dis- 
abled remotely by the 
mobile phone operator 


Xilinx IRL 
technology (Figure 1). 


using 


When returned to its 
owner, the handset 
could then be repro- 
grammed again via an 
IRL bitstream to enable 
the keypad again. (IRL technology enables 
the remote upgrading or programming 


process of CPLD or FPGA hardware over 


any kind of network, including wireless.) 


It is virtually impossible for the 
CoolRunner-II device (Figure 2) to be 
reactivated or for data to be retrieved 
from the handset by the thief or hacker. 
Xilinx Comprehensive Design Security 
provides an unprecedented four aspects of 


design security: 


¢ Prevention of accidental/purposeful over- 
writing or read back of the configuration 


pattern 


¢ Blocking visual or electrical detection of 


the configuration pattern 


e Automatic device lockdown in response 


to electrical or laser tampering 


¢ Physical implementation of the protec- 


tion scheme that is virtually undetectable. 


These design security aspects are not only 
buried within the layers of the device, but 
they are also scattered throughout the die 
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Can be enabled and disabled remotely 


if the handset is reported stolen 


Multiple security mechanisms 
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Figure 1 - Mobile phone block diagram with Xilinx Comprehensive Design Security 


to make their detection impossible. 
Designers can now design with the knowl- 
edge they are using the best design security 
available on any CPLD in the industry. 


The enhanced features of CoolRunner-II 
also elevate the CPLD from an ASIC fix to 


e Different mechanisms disabled 
e Buried interconnects 
e Multiple security signals 
e Scattered and layered 


CoolRunner-i 





Figure 2 - Multiple security functions embedded in 
CoolRunner-II CPLD 
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that of an ASIC replacement. New features 
include clock doubling and clock division 
for lower power consumption, various I/O 
standards support for level translation, 
selectable Schmitt trigger inputs to solve 


signal integrity challenges, and bus hold. 


Conclusion 


By utilizing a combination of IRL technol- 
ogy, Comprehensive Security Design, and 
ultra-low power CoolRunner-II CPLDs to 
remotely disable the hardware within a 
stolen mobile phone, phone operators and 
manufacturers can remove the incentive for 
theft — and improve the odds of recovery 
with geographic locator technology. 


Furthermore, the triple-threat security 
technology of Xilinx-protected products 
can be extended to any consumer appli- 
ance, including laptops, set-top boxes for 
cable TV, PDAs, and other devices with 
wireless or wired network connectivity. As 
the word gets out, consumers will look for 
the Xilinx brand in the products they buy 
— and thieves will look the other way. %& 
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SpartarlE FPGA solutions 
can help you to customize 
your virtual private network 
for highest performance, 
greatest security, and 
lowest cost. 


by Amit Dhir 

Manager, Strategic Solutions 
Optical /Wired Networks 
Xilinx, Inc. 
amit.dhir@xilinx.com 


The worldwide VPN (virtual private 
network) market will reach $32 billion by 
2003, up from $3 billion in 2000, accord- 
ing to Infonetics Research. VPNs provide 
highly secure, temporary, point-to-point 
“tunnels” through the public Internet. 
These tunnels are created on demand and 
are removed when a session ends. VPNs 


offer you the following advantages: 
¢ Lower operating costs 


e Extendable networks to respond to 


changing business demands 


e Anywhere, anytime access to e-mail, 


intranet, and other shared applications. 


Essentially, VPNs are private networks 
deployed over public networks that provide 
similar levels of privacy, security, quality of 
service (QoS), reliability, prioritization, 
end-to-end management, and manageabili- 
ty as local area networks (LANs). 


With Spartan™-IE FPGAs, the right soft- 
ware, and Internet gateways, you get secure 
data encryption and packet authentication 


even over the wide-open public Internet. 


Furthermore, compared to leased line net- 


works, Xilinx-enabled VPNs are: 

e Simpler to set up and easier to administer 
¢ More dynamic and extendable 

e Less expensive to create and deploy 


¢ More accessible anywhere, anytime. 
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How Do VPNs Work? 


VPNs use the band- 
width of public, packet- 
routed networks — typi- 
cally the Internet or a 
service provider's back- 
bone network (Internet 
Protocol, frame relay, 
or AIM). Corporations 
have company | head- 
quarters, remote offices, 
mobile workers, inde- 
pendent contractors, 
and business partners 
connected to a network 
service provider's local 
points of presence 
(POP). Remote users 
and the company’s 
LAN(s) connect to the 
provider's network 
using dial-up, DSL, 
cable, ISDN, T1/T3, 


and wireless. 


The key to VPNs is “tunneling” — the practice 
of repackaging data from one network to 
another. At the originating end of a tunneled 
transmission, a data packet is “wrapped” or 
encapsulated with new header information 
that allows an intermediary network to reor- 
ganize and deliver the packet. At the termi- 
nating end, the terminal protocol “wrapper” 
is removed, and the original packet is deliv- 


ered to the destination. 


Tunneling does not ensure privacy or securi- 
ty — just delivery. To secure a tunneled trans- 
mission against interception and tampering, 
all traffic must be encrypted. Therefore, 
VPNs must include additional functional 
features to enhance transmission security, 
ensure quality of service (QoS), and protect 


the VPN perimeter with a firewall. 


Security and privacy features include tunnel- 
ing itself, data encryption, packet authenti- 
cation, firewalls, and user identification. 
Tunneling uses IPSec (Internet Protocol 
Security), and encryption uses DES (Data 
Encryption Standard), Triple DES, and 
Diffie-Hellman cryptographic algorithms. 


VPN QoS and bandwidth management 


make possible delivery of high transmission 


Summer 2002 


4! 
s 
\ 





Pty 
quality, mission-critical applications (such 
as financial reporting and order process- 
ing), and real-time voice/video applications 
(such as distance learning and videoconfer- 
encing). Packets are tagged with priority 
and time sensitivity markers to allow traffic 
to be routed based on delivery priorities. 
Tagged packets take precedence over 


bandwidth-consuming applications (such 


as Web surfing). 


Network management features simplify 
the addition, deletion, or changing of 
users; software upgrades; and_ policy 
management for security and QoS assur- 
ance. These features make scalability and 


interoperability possible. 


Xilinx Spartan-IIE FPGAs — Providing 
Encryption with Flexibility 


VPN gateways secure Internet-based com- 
munication, perform user identification, 
authenticate data integrity and confidentiali- 
ty, and reject all non-tunneled IPSec traffic. 
Encryption and authentication lie at the 
heart of VPN products, and speeding up the 
processing of encryption algorithms boosts 


the performance and scale of VPN solutions. 


Spartan-IIE FPGAs programmed with 


proprietary and/or standard encryption 


i) 


cores provide flexibility 
without compromising 
cost or performance — 
and reducing time to 


market. 


Implementing encryp- 
tion in programmable 
hardware _ significantly 
improves performance 
over software solutions. 
By using parallel com- 
puting, FPGAs encode 
and decode larger trans- 
mission blocks more 
effectively. The encrypt- 
ed key can be changed 
within the FPGA fabric, 
and if a key is ever 
broken, the Xilinx 
Spartan-IIE solution can 
be reconfigured instantly 


| with new algorithms. 


Spartan-IIE FPGAs give you the power to 
to choose and optimize just the right fea- 
ture set and intellectual property cores you 
need to implement your designs. You can 
integrate functionality, such as PCI, mem- 
ory controllers, and other components, 
within the Spartan-HE device to customize 
your product and reduce costs. This flexi- 
bility comes at a significantly lower cost 
compared to ASSPs (application specific 
standard products). 


Conclusion 


VPNs improve the productivity of remote 
workers and hence, their organizations. 
VPNs promote flexible work styles, 
extend workplaces beyond office walls, 
connect remote offices with the headquar- 
ters, and foster competitive advantages 
with strategic partners-— all at reduced 


costs, compared to. other options. 


Using the Internet and/or service provider 
backbones to transfer confidential data 
requires state-of-the-art encryption and 
privacy solutions. Spartan-IIE FPGA- 
based encryption and security give you the 
scalability and flexibility you need to 
implement the perfect virtual private net- 


work for your enterprise. & 
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Perspective Bluetooth Security 


Leveraging Bluetooth Technology 
to Build Secure and Robust 
Wireless Communications / 
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Programmable logic is used to design wireless products 
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by Mamoon Hamid, Manager 

Strategic Solutions, Consumer/Home Networking 
livemlie 

mamoon.hamid@xilinx.com 


Although Bluetooth™ wireless technology 
promises to be a ubiquitous, low cost solu- 
tion — perfect for a very wide range of data 
communication — there are two major 
areas that further need to be addressed: 
data security and data integrity. These two 
areas will serve to limit range of applica- 


tions where Bluetooth can be deployed. 
Bluetooth Data Security 


The Bluetooth specification has defined a 
set of security mechanisms to provide a very 
basic level of protection for short-range 
Wat Cncmmecyestooltrante-tu Coys or-lol sms) (tlacetelaal 
device is required to implement key man- 
agement, authentication, and encryption. 
The Bluetooth Generic Access Profile 
(GAP) divides security into three different 
modes. In addition, Bluetooth technology 
implements a frequency-hopping scheme 
that also serves as an additional security 


measure against eavesdr 0) 0) TS ee 
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Key Management 


Bluetooth security uses several different 
keys in higher layer software to ensure 
secure transmissions. Figure 1 shows a 


block diagram for key management. 
Device Authentication 


Authentication is a critical security com- 
ponent of Bluetooth systems, allowing a 
domain of trust between an ad-hoc net- 
work of Bluetooth devices. Bluetooth 
authentication is based on a challenge- 
response scheme that secures the connec- 
tion between two devices. When a device 
is not authenticated, a period of time must 
pass before a new attempt for authentica- 


tion can be made. 
Encryption 


Bluetooth Encryption is used to ensure 
the privacy of a connection. Prior to 
encryption, an authenticated link must 
be in place. Encryption scrambles the 
data payload of Bluetooth packets using 
an 8-bit to 128-bit key, however, the 
Bluetooth access code and the packet 
header are never encrypted. Payload 
encryption depends on the strength of 
encryption desired and geographical reg- 
ulations in the countries in which the 


product is being deployed. 


Physical Layer Data Security — Frequency 
Hopping Spread Spectrum 


In addition to the other security features 
built into Bluetooth equipment, the fre- 
quency-hopping scheme employed for 
Bluetooth communication (Frequency 
Hopping Spread Spectrum or FHSS) also 
serves as mechanism to make eavesdrop- 


ping extremely difficult. 


The Bluetooth radio operates in the 2.4 GHz 
ISM band. In North America and most of 
Europe, Bluetooth radio uses a frequency 
range from 2.402 GHz to 2.480 GHz, 
with this band divided into 79, 1-MHz 
subchannels. In frequency hopping, a 
data signal is modulated with a narrow- 
band carrier signal that hops from fre- 


quency to frequency as a function of time. 
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Figure 1 - Bluetooth key management 


Bluetooth radio employs a hopping 
sequence of 1600 hops per second. 


FHSS relies on frequency diversity to com- 
bat interference. If the radio encounters 
interference on one frequency, the radio 
will retransmit the signal on a subsequent 
hop on another frequency. The aggregate 
interference will be very low, resulting in 


little or no bit errors. 


Bluetooth Data Integrity — Forward 
Error Correction (FEC) 


Forward error correction reduces the num- 
ber of retransmissions for a data payload. 
However, the drawback is that it can 
tremendously reduce the achievable 
throughput. There are three types of for- 


ward error correction that are implemented 


by Bluetooth technology: 
¢ 1/3 rate FEC — Each header bit is repeat- 


ed three times. It is primarily used for 
masking errors in the header, which con- 


tains vital link information. 


e 2/3 rate FEC — Every ten information 
bits are encoded into a 15-bit code word. 
This type of error correction is primarily 
used for sending data packets in the least 


vulnerable way. 


e Automatic Repeat reQuest (ARQ) — 
Used to transmit and retransmit data 
packets, among other things, until the 
reception of those packets is acknowl- 
edged from the recipient. The recipient 
returns the acknowledgement in the 
header sent back to the sender of the 


initial packet. 
Data Whitening 


All the header and payload information is 
scrambled with data whitening bits before 
transmission occurs, so that redundant 0’s 
or l’s are eliminated from the transmis- 
sions. This can be a problem with analog 
data signals received by the baseband 
processor because they need to be classified 
as 0 or 1. Because there is no reference 
point such as DC for these signals, you 
must rely on the last few signal transmis- 
sions for calibration purposes. Any long 
sequential string of 0’s or 1’s may cause cal- 
ibration to fail. Hence data whitening is 
used to scramble these signals so that the 
probability of long strings of 0’s or 1’s is 


significantly reduced. 
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Programmable Solutions for Advanced 
Data Security 


Bluetooth data security is inadequate for 
most applications where a high level of con- 
fidentiality is of the utmost importance. It 
seems to be adequate for smaller applica- 
tions, but any sensitive or otherwise prob- 
lematic data should not be transmitted via 
Bluetooth. For example, the encryption 
scheme used in Bluetooth shows some 
weaknesses. In one possible scenario, an 
attacker can obtain the encryption key that 
secures Communication between two devices 
and can then eavesdrop on their messages. 
The attacker also could claim to be one of 
the devices and insert false messages. One 
way to avoid this, Lucent Technologies says, 
is for the users to employ lengthy personal 
identification numbers to increase the diffi- 


culty of discovering the encryption keys. 
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Figure 2 - X_DES cryptoprocessor block 
diagram (courtesy of: InSilicon Corp.) 


This again involves manually entering per- 
sonal identification numbers (PINs). This 
can quickly become a nuisance; it is a tedious 


process to secure the connection. 


Another way to overcome these security issues 
would be to use a more robust encryption 
algorithm, such as Data Encryption Standard 
(DES), or even Triple DES, or Advanced 
Encryption Standard (AES), instead of the 
stream cipher EO algorithm. DES is a block 
cipher, which essentially means that the 
cipher encrypts blocks of data all at once and 
then proceeds to the next block. In DES, the 
original message is divided into fixed length 
blocks of 64 bits, enciphered using a 56-bit 
secret key into a 64-bit encrypted message by 
means of permutation and substitution. A 


block diagram of DES is shown in Figure 2. 


Unlike the Bluetooth stream cipher algo- 


rithm, it has been mathematically proved 





that the block ciphers are completely 
secure. The DES block cipher is highly 
random, nonlinear, and produces encrypt- 
ed text that functionally depends on every 
bit of the original message and the key. 
The DES has more than 72 quadrillion 
(72 x 1015) possible encryption keys that 
can be used. For each given message, the 
key is chosen at random, from among this 
enormous number of keys. The DES algo- 
rithm is widely used and is considered very 
dependable, however, an even more secure 
variant, Triple DES, which applies DES 
three times with different keys, in succes- 


sion is also available. 


Both of these encryption algorithms — 
DES and Triple DES — can be imple- 
mented using low-cost programmable 
logic devices and readily available intel- 
lectual property (IP) for a higher level of 
encryption. Today, you can buy 100,000 
system gate Spartan™ FPGAs in volume, 
off the shelf, and ready to go, for just $10 
per unit. These FPGAs also allow addi- 
tional functionality, such as advanced 
error correction, to be added to a design 
as well. Hence, these devices enable dras- 


tic system-level cost reductions. 


Another encryption algorithm that is gain- 
ing rapid acceptance (due its superior 
robustness and optimal implementation in 
hardware and software) is AES, also called 
Rijndael. AES is a 128-bit block cipher 
with selectable 128, 192, or 256-bit key 
lengths. AES encryption and decryption 
blocks can be implemented individually 
or as a whole in programmable logic. In 
addition, the key generator can be para- 
meterized based on key length. A pro- 
grammable logic-based implementation is 
modular and also allows you to choose a 
hardware implementation based on the 
level of security desired. Shown in Figure 
3 is a block diagram of AES. The cost on 
a silicon basis for this encryption core 
with a 128-bit key generator is less 
than $2 today. Variations can be imple- 
mented depending on the end application 


security requirements. 
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Software-based encryption solutions offer 
high flexibility, but low performance. 
Hardware-based encryption solutions pro- 
vide high performance, yet provide very 
little flexibility once designed. An encryp- 
tion solution based on programmable 
logic provides the best of both worlds — 
you get high levels of flexibility as well as 


high performance. 


A more robust encryption algorithm will 
allow Bluetooth technology to be safely 
deployed in a wide range of applications 


where security is of the utmost importance. 


This includes: 


e Electronic financial transactions 


such as AIM or Smart Cards 
e Secure e-commerce transactions 
¢ Secure office communications 
e Secure video surveillance system 
¢ Digital set-top boxes 
e HDTV 


¢ Other consumer electronics devices. 
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Programmable Solutions for Advanced 
Data Integrity 


The Bluetooth specification defines meas- 
ures that are effective for preventing unin- 
tentional errors from being transmitted 
using data whitening and error checking. 
However, a particular stream of data, which 
would consistently cause transmission 
errors, is theoretically possible, but highly 


unlikely in normal transactions. 


There is still a risk of error-prone transmis- 
sions in more hostile environments, such 
as industrial locations, office buildings, 
airports, and metropolitan public trans- 
portation. In locations like these, noise and 
interference from other multiple Bluetooth 
devices, wireless networks, phone systems, 
or other electronic devices operating on the 
same frequency band can cause problems. 
Again, programmable logic and advanced 
error-correction IP can be used to facilitate 
error-free Communication. A robust for- 
ward error correction technique, such as 
turbo convolution coding, is an alternative 


for a hostile communications environment. 


_ encrypt status. 


round keys 


Helion AES 
Roundkey Core 


Figure 3 - AES encryption core (Courtesy of: Helion Technology) 
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Turbo coding is an advanced forward 
error correction algorithm. It is a stan- 
dard component in third-generation 
(3G) wireless communication systems, 
like those employing wideband code 
division multiple access. The principle of 
turbo coding is that the encoder gener- 
ates a data stream consisting of two inde- 
pendently encoded data streams and a 
single un-encoded data stream. The two 
parity streams are weakly correlated due 
to the interleaving. In the turbo decoder, 
the two parity streams are separately 
decoded with soft decision outputs, 


referred to as “extrinsic” information. 


The strength of the turbo decoding 
results from the sharing of the extrinsic 
information during a number of itera- 
tions. [he extrinsic information is passed 
from one parity decoding step to the 
other, from one iteration to the other. IP 
for such a turbo convolutional codec is 
readily available and is easily implement- 
ed on low cost programmable logic 
devices such as the Xilinx Spartan series. 
This solution can ensure the integrity of 
Bluetooth data transmissions in hostile 


and error-prone environments. 
Conclusion 


Bluetooth wireless technology is chang- 
ing the way we communicate. However, 
it still faces challenges on the level of data 
security and integrity for secure applica- 
tions. Low-cost programmable logic 
devices are ideally suited for advanced 
data security and integrity implementa- 
tion. This will be a necessity for those 
who wish to implement higher levels of 
security into applications where security 
is of the utmost importance. Also, appli- 
cations that will be in operation in hos- 
tile environments will benefit from a cus- 
tomizable programmable logic imple- 
mentation. Programmable logic is an 
excellent solution for integrating custom 
Bluetooth implementations, such as 


these, into embedded systems. %& 
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Home Networking 


. by Amit Dhir, Manager 
Strategic Solutions, Optical /Wired Networks 
IEEE 1394 ond HAVE = 
amit.dhir@xilinx.com 


@ The consumer world is buzzing with news 

and standards to network home appliances, 

re T Q le (] | I) ( PCs, peripherals, and other consumer 
devices. A number of technologies are 


vying to be the next technology to network 


© @ 
all appliances in the home. Some of these 
A I) () () les Of ( 22 technologies include HomeRF™, 
Blyctooth!*) yeliperlaAN2, IEEE 802.11; 
Ethernet, HomePNA™, USB 2.0, and 


@ 
Although each technology presents unique 


pros and cons, the technologies that require 


wires do provide one critical capability — 


Home networking promises to be one of the hottest new markets Hi abil, to GEMM liable hish-apeed 


in consumer electronics — and the combination of IEEE 1394 voice, data, and video transfer. Home net- 
and HAVi technologies show the most promise to become the Ong Je incor ithour the auuity 
standard for home networking. 


to transfer voice, data, and video together. 


Ethernet, USB 1.1/2.0, and IEEE 1394 top 


the list of these “new wires” technologies: 


e Ethernet is a low-cost technology, which 


uses a protocol ideal for data traffic. 


¢ USB 1.1 and its more recent specification 
USB 2.0 are primarily PC-centric — net- 
working PCs and PC peripherals. 


e IEEE 1394 is the technology that holds 
the most promise to be the perfect home 
networking technology that will network 


audio, video, and PC equipment. 
IEEE 1394 Backgrounder 


The move from analog to digital technology 
has enabled the sharing of video, Internet, 
data, and audio. Consumers are constantly 
searching for faster, cheaper, more reli- 
able, and easier ways of transferring and 
sharing information. While information 
sharing has been happening at the enter- 
prise level for the last few years, the home 
is just seeing its introduction. This phe- 
nomenon, known as home networking, 
allows the interconnection of PCs, con- 
sumer equipment, and communications, 
and the distribution of infotainment 


among these appliances. 
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Configuration & 
Error Control 


Serial Bus 
Management 


Figure I - IEEE 1394 protocol stack 


The convergence of voice, data, and video 
in the home will happen only when seam- 
less, high-speed communication becomes 
readily available. IEEE 1394 is one such 
interconnection technology that will 
enable connection of these devices 
throughout the home. IEEE 1394, also 
known as 1394, FireWire™ or iLink™, is 
a versatile, high-speed, and inexpensive 
method of interconnecting a variety of con- 
sumer electronic devices (such as digital 
TV, set-top boxes, and home theater equip- 
ment), PCs, and PC peripherals (such as 


scanners and printers). 


The FireWire bus standard, originally cre- 
ated by Apple Computer, was born out of 
the need for a low cost, consumer-oriented 
connection between digital-video 
recorders and PCs. FireWire technology 
grew into a standard — called IEEE 1394 — 
for low cost, high data rate connections. In 
1994, the 1394 Trade Association (1394ta) 
was formed to support and promote the 


adoption of the IEEE 1394 standard. In 
1995, the 1394ta formally released the 
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1394 specification. Further revisions like 
1394a in 1998 and 1394b in 1999, were 
introduced. The 1394b standard is fully 
backward compatible with the current 
1394 and 1394a specifications. Each revi- 
sion of 1394 has added features, perform- 


ance, and capabilities. 
IEEE 1394 Architecture 
The components that form a 1394-based 


home network include the actual protocol 
itself, the cabling system, and the architec- 
tural design of the network. Similar to 
other high-speed networking systems, 
IEEE 1394 adopts a layered approach to 
transmitting data across a physical medi- 
um. The four layers used by the IEEE 1394 


protocol are shown in Figure 1. 
Physical Layer 


The physical layer provides the electrical 
and mechanical connection between the 
1394 appliance (connector) and the cable 
itself. Besides the actual data transmission 
and reception tasks, the physical layer also 


provides arbitration to insure all devices 


Home Networking 


have fair access to the bus. As its physical 
media, 1394 requires optical fiber or high- 
grade copper wiring between the appli- 
ances. [he 1394 physical layer is physically 
point-to-point and logically a bus (each 


node is a repeater). 


The physical layer transmits the unstruc- 
tured, raw, bit stream over a physical medi- 
um, and describes the electrical, mechani- 
cal, and functional interface to the carrier. 
It provides the linking to the upper sessions 
via signaling and the initialization and arbi- 
tration services necessary to assure that 
only one node at a time is sending data. 
The physical layer of the 1394 protocol 
includes electrical signaling, mechanical 
connectors and cabling, arbitration mecha- 
nisms, serial coding and decoding of the 
data being transferred or received, and 


transfer speed detection. 
Link Layer 


The link layer takes the raw data from the 
physical layer and formats it into two 
types of recognizable 1394 packets — 


asynchronous and isochronous. 


Asynchronous data transfer is the conven- 
tional transmit-acknowledgment protocol 
and puts the emphasis on guaranteed 
delivery of data, with less emphasis on 


guaranteed timing. 


Isochronous data transfer is a real-time 
guaranteed-bandwidth protocol for just-in- 
time delivery of information. It puts the 
emphasis on the guaranteed timing of the 
data and less emphasis on delivery. 
Isochronous transfers are always broadcast 
in a one-to-one or one-to-many fashion. 
No error correction or retransmission is 


available for isochronous transfers. 


Transaction Layer 


The third layer in the IEEE 1394 protocol 
is called the transaction layer and is respon- 
sible for managing the commands that are 
executed across the home network. It sup- 
ports the asynchronous protocol write, 
read, and lock commands. A write sends 
data from the originator to the receiver, and 


a read returns the data to the originator. 
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Serial Bus Management 


The fourth and final logical grouping of 
functions is responsible for the overall con- 
figuration control of the serial bus. It con- 
trols the serial bus in the form of optimiz- 
ing arbitration timing, guarantee of ade- 
quate electrical power for all devices on the 
bus, assignment of which 1394 device is 
the cycle master, assignment of isochronous 
channel ID, and basic notification of 
errors. The bus management is built upon 


IEEE 1212 standard register architecture. 
Benefits of IEEE 1394 


¢ Broad industry and standards bodies sup- 
port 


¢ Low cost 

¢ High and scalable speeds 
¢ Plug and play 

¢ Nonproprietary. 


IEEE 1394 is an enabling technology for 


connecting multimedia devices, such as: 
¢ Digital camcorders and VCRs 

e Satellite modems 

¢ Set-top boxes 

° Digital TV 

e PCs 

¢ DVD players 

¢ Gaming consoles 

¢ Home theater 


¢ Musical synthesizers/samplers with digi- 


tal audio capabilities 
— Digital audio tape (DAT) recorders 
— Mixers 
— Hard-disk recorders 
e Video editors. 
HAVi Backgrounder 
HAVi (Home Audio/Video interoperability) 


is an industry initiative started by Sony and 
Philips in 1996. Since then six other com- 


panies have joined — Thomson, Hitachi, 


Toshiba, Matsushita, Sharp, and Grundig. 
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HAVi adopted the IEEE 1394 bus stan- 
dard as the underlying network technology 
for the HAVi protocols and for the trans- 
port of real-time audio/video (A/V) 
streams. Using 1394 as the bus standard 


(shown in Figure 2) provides benefits, such 
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dor. In the upcoming world of digital tech- 
nologies, HAVi extends this networking 
model by allowing communication among 
consumer electronic appliances from mul- 
tiple brands in the home. The HAVi mid- 


dleware architecture specifies a set of APIs 





IEEE 1394 PHY 


Figure 2 - Simplistic figure showing HAVi using IEEE 1394 bus standard 


as high-speed, flexible connectivity, and the 
ability to link up to 63 appliances together. 
No other interconnection technologies, 
such as wireless, HomePlug™, HomePNA, 
and USB, are capable of distributing high- 


speed video applications. 


The HAVi middleware architecture is an 
open (nonproprietary), lightweight, and 
platform-independent specification that 
allows development of home networking 
applications. It does not, however, address 
home networking functions such as con- 
trolling the lights or monitoring the cli- 
mate within the house. HAVi specifically 
focuses on the transfer of digital A/V con- 
tent between in-home digital appliances, 
as well as the processing (rendering, 
recording, and playback) of this content 
by HAVi-enabled appliances. 


The HAVi middleware system is independ- 
ent of any particular operating system or 
CPU and can be implemented on a range of 
hardware platforms, including digital prod- 
ucts such as cable modems, set-top boxes, 
integrated TVs, Internet TVs, or intelligent 


storage appliances for A/V content. 


Today in the world of analog consumer 
electronic appliances, there exist a number 
of proprietary solutions for interoperability 


among appliances from one brand or ven- 


(application programming interfaces) that 
allow consumer electronic manufacturers 
and software engineering companies to 
develop applications for IEEE 1394-based 


home networks. 


One of the main reasons HAVi selected 
IEEE 1394 over other transmission proto- 
cols is because of its support for isochro- 
nous communications. HAVi comprises 
software elements that facilitate the interop- 
erability among different brands of enter- 
tainment appliances within the house. 
Interoperability is an industry term that 
refers to the ability of an application run- 
ning on an in-home appliance to detect and 
use the functionality of other appliances 


that are connected to the home network. 


The underlying structure for a home net- 
work based on HAVi technologies is a 
peer-to-peer network, where all appliances 
can talk to and interact with each other. 
HAVi has been designed to allow the 
incremental addition of new appliances, 
which will most likely result in a number 
of interconnected clusters of appliances. 
Typically, there will be several clusters of 
networks in the home, with one per floor 
or per room. Over time these clusters will 
be connected with technologies such as 


1394 long or wireless 1394. 
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Benefits of HAVi 


A HAVi-compliant appliance offers a num- 


ber of advantages, which include: 
e Automatic detection 


¢ Automatic registration of added 


appliance to the HAVi network 
e Automatic software upgrades 
¢ Manageability 
¢ Brand independence 
¢ Hot plug-and-play 
¢ Legacy appliance support. 


Xilinx Programmable Solutions Enable 
IEEE 1394 /HAVi Products 


The IEEE 1394 technology consists of a 
physical layer for encoding-decoding, 
arbitration, a medium interface, and pro- 
vides an electrical signal and mechanical 
interface. The link layer provides cycle 
control, packet transmits, packet receives, 
CRC, and provides the host and applica- 


tion interface. 





Figure 3 - The IEEE 1394 physical layer 


and link layer controller with a host interface 
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As shown in Figure 3, programmable logic 
solutions provide complete link layer func- 
tionality with the ability to connect to mul- 
tiple interfaces such as PCI, USB, and pro- 
prietary audio-video buses. The advantage 
of programmability is realized when there 
is a proprietary application interface. 
However, in an IEEE 1394 system, the 
Spartan!™-II FPGA provides system inter- 
face and other ASSP functionalities. 


With the 1394 specification still continu- 
ing to evolve, having the link layer con- 
troller programmed in a FPGA provides 
the ability to reprogram the FPGA with the 


latest 1394 revision. 


Supporting different products requires the 
support and interface to different interfaces. 
For example, using 1394 in a PC requires 
an interface to PCI, PCMCIA, and other 
proprietary interfaces. Programmable solu- 
tions are ideal for providing this interface, 
because developing ASSPs for these applica- 
tions is relatively expensive. Also, the 
decreasing cost of programmable logic solu- 
tions makes them ideal for IEEE 1394 and 
HAVi-based products. 
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Conclusion 


The digital home continues to evolve and 
smarter appliances continue to populate 
the home. These smarter appliances and 
the need for sharing broadband data, 
voice, and video are pushing the need for 
home networking. Although several tech- 
nologies exist, the technology that pro- 
vides high-speed and reliable delivery of 


voice, data, and video will win. 


IEEE 1394 is one such home networking 
technology that provides both high-speed 
and reliable delivery. The proliferation of 
1394 as the A/V standard will be acceler- 
ated through the use of HAVi as its mid- 
dleware solution to connect disparate 
devices, thus providing a complete solu- 


tion to the consumer. 


However, the specification continues to 
evolve and 1394 products need to coexist 
with other home networking and system 
interface technologies, thus requiring prod- 
ucts that interface between these technolo- 
gies. With the continuing reduction in prices 
of programmable logic solutions, these are 
ideal solutions to provide interfaces between 
1394 and other technologies such as PCI, 
USB, PCI-X, SCSI, and HomePNA. %& 
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You Can Take It with You: 
On the Road with Xilinx 


by Karen Parnell, Manager 
Automotive Product Marketing 
Xilinx, Inc. 
karen.parnell@xilinx.com 


Today’s consumers demand the comforts of 
home and the productivity of the office 
when they drive. They also want the latest in 
information and safety systems for their cars. 
The resulting convergence requires interface 
standards and protocols to interconnect and 
interoperate. These standards and protocols 
are still emerging, and will get even more 
complicated before being standardized. The 
result is a manufacturing dilemma. Should 
manufacturers make an educated guess now 
as to which standard will prevail, and risk 
making the wrong choice? Or should they 
wait for the standards to be fixed, and risk 
getting left behind? 
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Xilinx has the answer. With Xilinx IQ pro- 
grammable logic solutions for automotive 
applications, manufacturers can win the 
time-to-market battle risk free by using the 
ability to reconfigure their products to 
accommodate any standard — past, present, 
or future. Reconfigurable logic in produc- 
tion allows manufacturers to reconfigure 
the units in-car to implement new hard- 
ware-based features. Xilinx high-volume 
IQ FPGA and CPLD devices are cost- 
effective solutions that retain the tradition- 
al PLD time to market advantage. As a 
result, many leading telematics and info- 
tainment product manufacturers use 
Xilinx programmable logic devices. These 
manufacturers recognize the added flexi- 
bility and time to market benefits that pro- 
grammable logic solutions provide. 


products and technology are putting office technology 
nd functionality into next-generation automobiles. 


Driving Your Office 


Information Age professionals need to be 
able to work from wherever they happen to 
be — from home, from hotel rooms, from 
airports, from 36,000 feet over the Atlantic. 
They need to take their offices with them. 


The laptop computer was a critical step in 
the development of offices-on-the-go, but 
now workers are demanding even more. 
One result has been the in-car office. The 
automobile is no longer just a way to get 
from Point A to Point B. Now it can include 
GPS navigation with intelligent route find- 
ing, integrated PDA functions, built-in 
mobile communications, entertainment sys- 
tems, and even in-dash personal computers. 


Automotive electronic designers face the 
same challenges as designers of consumer 
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Multimedia Platform Design Approach 





Prototyping 


e Architecture choice 

¢ Components 

e Prototyping 

e Technology evaluation : 
services 
© Pretesting 


© Generic software services 


© Changing specifications 
and protocols 


Production 


e Freeze specifications 


e Single platform, multiple 
manufacturing variations 


¢ Design customer-specific 


e Printed circuit boards 
© Configuration 


© Mechanics 





Application 


e Flexible and 
field-upgradeable 
after the sale 





e Customizable look and feel 


e Add new features 
and functions 


Figure 1 - Automotive multimedia platform design approach 


equipment. They must grapple with time to 
market pressures, cost restrictions, and a 
profusion of new standards and protocols. 
They also face added challenges like a con- 
strained design area, and the need for a sim- 
ple user interface so drivers can keep their 
eyes on the road. 


As one example of the new technology, 
the latest BMW 7 Series automobiles 
allow the driver to operate the car’s heat- 
ing and air conditioning, entertainment 
systems, wireless communications 
devices, navigation instruments, and PC 
applications, all on a single display, by 


means of a joy-stick type of device. 


Xilinx programmable logic devices enable 
automotive designers to provide their cus- 
tomers with a more informative, productive, 
safe, and entertaining in-car environment. 
By using the flexibility, time to market 
advantage, and after-sale reconfigurability of 
Xilinx devices, manufacturers can be first to 
market and best in class long after the sale. 


The Automotive Multimedia Platform 


As more information and entertainment 
systems are added to automobiles, there's 
an inevitable conflict among the bewilder- 
ing array of standards and protocols being 
tested, including Bluetooth™, BlueCAN, 
MOST (media-oriented system transport), 
FireWire™, CAN (controller area net- 
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work), TTP (time triggered protocol), and 


Flex Ray™ technologies. 


Designers of in-car multimedia systems 
may soon have to provide for traffic infor- 
mation systems, Internet and Web access, 
electronic game consoles, MPEG music 
download capability, digital radio recep- 
tion, and mobile commerce services. 


Designers must ensure that the car's multi- 
media system can communicate with the 
automobile’s other devices. For example, the 
car should automatically detect a new mobile 
phone and connect it seamlessly to the car's 
communications network. This demand for 
automatic connectivity may soon extend to 





Figure 2 - Acunia automotive development 
platform with Xilinx Spartan-II FPGA and 
CoolRunner™ CPLD onboard 





PDAs, portable PCs, MP3 players, and other 


personal portable electronic equipment. 


As in-car and consumer functions converge — 
and new automotive and consumer stan- 
dards and protocols emerge — designers have 
begun to prototype multimedia platforms 
that can provide as much, or as little, func- 
tionality as required. The best way to create 
such multimedia platforms is to design 
reconfigurable hardware. Engineers can pro- 
gram reconfigurable hardware late in the 
production flow to provide custom function- 
ality on a standard hardware platform. At the 
same time, they can configure the hardware 
to accommodate new standards. 


Figure 1 shows the automotive multimedia 
platform design approach. This concept 
allows upgrades throughout the life of the 
car. Designers can implement these 
upgrades remotely by means of wireless 
communications and Internet connectivity. 
Xilinx Internet 


(IRL™) technology makes this remote 


Reconfigurable Logic 


upgrading process possible. 


With Xilinx IRL technology, designers can 
upgrade, modify, and fix systems long after 
the car leaves the dealership. For example, an 
engineer could remotely add a new MP3 
player to an in-car multimedia system, or 
upgrade the system with the latest protocol. 
This same technology can even disable the 
multimedia unit if it is stolen, then re-enable 
it when it is returned to the rightful owner. 


Multimedia System Design Flow Using FPGAs 
The multimedia platform should ideally be 


based on one Human Machine Interface 
(HMI). This HMI allows the user to access 
all functions by means of a touch screen. If 
the functions are implemented in software 
and reconfigurable hardware, the manufac- 
turer or dealer can upgrade the system even 
after the buyer has driven the car away. 


The new way of developing in-car systems is 
to prototype with FPGAs in a generic devel- 
opment environment. The designer can 
develop the elements quickly and easily with- 
out fixing specifications. This initial prototyp- 
ing phase can be realized using Virtex™-II 
Platform FPGAs. At this early stage, the 
designer can try, test, and debug the different 
standards, protocols, and functions by using 


the headroom provided by a large FPGA. 
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As the design firms up, the specifications are 
frozen. The designer can then select the spe- 
cific standard, protocol, or function from the 
many tested. This move from prototype to 
production enables the designer to optimize 
the design and fit it into smaller, lower cost 
FPGAs, such as Spartan™-IIE programmable 
logic devices. This migration from Virtex-II™ 
to Spartan-IIE FPGAs still allows for future 
system upgradability via IRL connectivity. 


Once the device is in production, the engi- 
neer can use the FPGA as an aid in total 
printed circuit board testing with JTAG 


techniques. If necessary, the engineer can 


also tweak and enhance the design even at 
this late stage. 


Recognizing the need for intelligent proto- 
typing platforms for use by in-car multi- 
media designers, leading next-generation 
telematics technology providers such as 
Acunia have produced the Xingu™ tele- 
matics platform shown in Figure 2. 


Conclusion 


The final stage is the look and feel of the 
product. Each car manufacturer can cus- 
tomize the product to fit into a specific 


dashboard (or fascia). All of the produc- 
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tion multimedia units are built up around 
the standard FPGA-based platform. The 
designer can program this standard plat- 
form with its personality late in the pro- 
duction flow to accommodate last-minute 
design changes or end-user preferences. 


For more information, visit these websites: 


Xilinx Automotive IQ Solutions: 
www.xilinx.com/automotive/ 


Acunia: www.acunia.com 


BMW 7 Series Sedan: 
www.bmw.com/bmwelproducts/ 


automobiles/7er/sedan/ ¥: 


Xilinx 1Q Solutions — Creating Automotive Intelligence 


To address the needs of automotive telematics designers, Xilinx has created a new family of devices with an extended indus- 


trial temperature range option. This new “IQ” family consists of existing Xilinx industrial grade (I) FPGAs and CPLDs with 


the addition of a new extended temperature grade (Q), available for selected devices. The new IQ product grade (-40°C to 


+125°C ambient for CPLDs and junction 
for FPGAs) is ideal for automotive and 
industrial applications. See Table 1. 


The wide range of device density and pack- 
age combinations enable you to deliver cost 
effective, high performance, flexible solu- 
tions to meet your most extreme application 


needs. See Table 2. 


Temperature Grade/Range °C 


FPGA | Ty= 010485 | Ty=-4010+100 | Ty= 40104125 
(PLD Ty = 0 to +70 Ty= 4010485 | Ty =-40t0 4125 


Table 1 - IQ temperature range 


With Xilinx IQ devices, you can design flexi- 


bility into your application and get your product to market faster than 
ever before. Because many new standards are evolving — such as the 
MOST (media-oriented system transport) and FlexRay™ in-car 
bussing protocols — you need the flexibility to quickly modify your 


Device Family 


NOW - XCSOSXL, XCSTOXL 
TR XCS2OXL, XCS3OXL, XCS40XL 





Availability 


designs at any time. With our Internet Reconfigurable Logic (IRL™) 
capability, you can remotely and automatically modify your designs, in 
the field, after your product has left the factory, and even after the sale. 


Q Grade Ordering Information 


XC9500XL (3.3V) 


NOW - XC9536XL, XC9572XL 


CoolRunner XPLA3 (3.3V) 
Spartan-Il (2.5V) 
CoolRunner-Il (1.8V) 
Spartan-llE (1.8V) 


Macrocell Count = XC9500XL Speed Grade 
CoolRunner XPLA3 (ns for CPLDs 
CoolRunner-I| | | 


) 
(Ambient for CPLDs) 
| (Junction for FPGAs) 


XCxxxXL — 5yyyQ 


Temperature Range 
Q = -40°C to +125°C 





Table 2 - Device availability schedule 


By combining our latest IQ programmable logic devices 


with our solutions infrastructure of high productivity 


software, IP cores, design services, and customer educa- 
tion, you can develop advanced, highly flexible prod- 
ucts, faster than ever before. 
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XCR = CoolRunner XPLA3 XL=3.3M Package Type 
XC95 = XC9500 CPLDs EN) VQ = VQFP 
XC2S = Spartan-ll TOS 1GEP 
XCS = Spartan-XL POE IPOER 
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FG = Fine Pitch BGA 
FT = Fine Pitch Thin BGA 


For more information, full product selector 
guide, data sheets, white paper, and more, go to 
www.xilinx.com/automotivel/. 
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by Jim Hwang, Senior Manager 


@ 
DSP Software and Design Methodologies 
Xilinx, Inc. 
jim. hwang@xilinx.com 


Platform FPGAs have become key compo- 


nents for implementing high-performance 


® td 
digital signal processing (DSP) systems, 
(] { | ( | QC { especially in digital communications, 
video, and image processing applications. 


defi _ FPGAs have memory bandwidth that far 
© ana @ . ! exceeds that of microprocessors and DSP 
A 7 . processors running at clock rates two to ten 
rr ; times faster, and unlike processors, 
ror f Platform FPGAs possess the ability to 
Fi implement highly parallel custom signal 
Systel | | > ile has been the relative unfamiliarity with 
ynaat = 
Phe = a pps 
the language of signal processing engineers 
Xilinx System Generator 2.2 software encbles hgh Naltanadali Bole, MATLAB™ lanpuase and 


| FPGA technology within the DSP com- 

+. = re : = munity. What is needed are tools that speak 

and implementation of Nsw ste nhs Ny ) inx f atform Eh XS (0) Simulink™ design tools from The 
outperform hadi ondl pie CeSSOIS. 





processing architectures. 


One of the main impediments to wider 


adoption of FPGAs for signal processing 






MathWorks, Inc. — while also supporting 
FPGA designers who already know how to 
achieve the highest performance circuit 


using the least number of FPGA resources. 


7 The new Xilinx System Generator 2.2 soft- 
ware meets the needs of both DSP engineers 
, in and FPGA designers. System Generator 
— enables high-level modeling and implemen- 
tation of DSP systems in the Virtex-II 


ero ee artex?™-[1, Virtex, and Spartan!™- 


II families of FPGAs. New in the 2.2 release 

is an increased capability to generate hard- 

) a ware that approaches the efficiency of hand- 
RILIN crafted modules, both in terms of perform- 
< SY | TeV ance and resource usage. In this article, we 
i describe how System Generator can be used 
= | G Ee N E r ATO = ™ to create a custom FIR (finite impulse 
response) filter, an operation that lies at the 


heart of most signal processing systems. We 








2 : 


p’,! demonstrate that in addition to accessing 


high-level Xilinx LogiCORE™ modules, 
you can use your own FPGA knowledge to 





advantage in customizing a filter data path to 
minimize resources while achieving high per- 
formance. You will see how System 
Generator provides a wide range of options 


for implementing signal processing systems. 
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Xilinx System Generator 


The Xilinx System Generator is a state-of- 
the-art “on-ramp” that allows you to move 
a DSP algorithm into a Xilinx FPGA 
quickly and easily. System Generator 
extends the capabilities of the Simulink sys- 
tem-level simulation environment with bit 
and cycle-true modeling of an FPGA cir- 
cuit. System Generator simultaneously 
provides access to key features in the FPGA 
fabric, including SRLIGE shift register 
logic, distributed and block memory, and 
embedded multipliers. 


System Generator includes a 
Simulink library of functional 
blocks for building DSP, arith- 
metic, and digital logic cir- 
cuits. These polymorphic 
blocks compute their output 
types based on their inputs, 
although alternatively, you can 
specify their quantized output 
types explicitly. You can com- 
bine Xilinx blocks with MAT- 
LAB and Simulink blocks to 


create a realistic test bench and 


is defined by its impulse response, a length 
of filter 


Nos Pyy.++5 Ay.» LE X 9, X1 X9-+-) IS a Sequence 


N sequence coefficients: 
of input values, where by convention, we 
define x;=0 for 7<0, the filter output 
sequence , 5 V5... is defined by the con- 
volution sum 


N-1 
va hiXy-i 
i=0 


That is, the filter output at time n is com- 
puted by accumulating a sum of products 
of the filter coefficients with the N most 


recent input samples. 


Sep eee ees ee ee eee ee es eee 
ee ee ee ee ee 


LES 0 


eS ed a 
Oe 


to analyze data computed by 
your model. The high level of 
provided by 


abstraction 


ede Abd ge Bae PAD hemes 


ee ee ee oe Pee 
ee ee een) 
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Mapping the Algorithm onto an FPGA 


You can implement an FIR filter in an 
FPGA in many ways. A versatile approach 
that maps well onto an FPGA employs a 
multiply-accumulate (MAC) engine to 
compute the sum of products. As shown 
in Figure 1, a MAC unit is easily con- 
structed in System Generator using the 
multiplier and accumulator blocks. The 
multiplier can be implemented either in 
the logic fabric or, for Virtex-II family 
FPGAs, using dedicated 18-bit x18-bit 
embedded multipliers. System Generator 
ensures the underlying IP core provides 
an efficient implementation. Note 
that upon reset, the accumulator 
reinitializes to its current input 
value rather than zero, to avoid a 


one-clock cycle stall. 
Customizing the Data Path 


Although Simulink provides a 
graphical block editor, System 
Generator should not be mistaken 
for a “schematic capture” tool. 
System Generator models are fully 
customizable and executable in 
Simulink without recompilation. 


(In fact, some blocks can be recon- 



































System Generator greatly sim- 
plifies algorithm development 


and verification. 


In addition to a system-level modeling 
library, System Generator includes a code 
generator that automatically generates a 
synthesizable VHDL netlist from your 
Simulink model. This netlist includes IP 
(intellectual property) blocks that have 
been carefully designed for high perform- 
ance and density in Xilinx FPGAs. System 
Generator also creates project and con- 
straint files to assist implementation using 
the Xilinx Foundation™ ISE 4.21 and 
soon-to-be-released ISE 5.1i tools, as well 


as the major synthesis tools. 
Building an FIR Filter 


As a simple but instructive example, let’s 
consider how System Generator can be 


used to create a parametric finite impulse 


response (FIR) filter. An N-tap FIR filter 
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Figure 1 - Multiply-accumulate engine 


In practice, all numbers must be repre- 
sented with a finite number of bits. With 
a traditional processor, numeric data are 
typically represented as 8, 16, or 32-bit 
integers, or in a floating-point representa- 
tion. In contrast, in an FPGA, we have no 
such word length limitations. We can cre- 
ate a custom data path processor having 
an arithmetic precision tailored to the 
application. System Generator supports 
this capability by providing an arbitrary 
precision fixed-point data type. Each 
block allows us to specify its output pre- 
cision and the policy for handling quanti- 
zation and overflow. We can model the 
system in the Simulink environment 
under a number of scenarios, and analyze 
the data to ensure exactly the right preci- 


sion for the application. 


figured during simulation.) Xilinx 
blocks support Simulink’s data type 
propagation capability, and provide 
extensive error checking on their 
parameters and usage. Because System 
Generator is seamlessly integrated with the 
Simulink tool suite, you can customize 
Xilinx blocks in ways that are impossible in 


schematic and other visual tools. 


For example, in our System Generator 
model we can specify the arithmetic pre- 
cision of the blocks in the data path using 
MATLAB expressions, making it possible 
to minimize the hardware used, and still 
avoid the possibility of overflow. For a fil- 
ter with k-bit coefficients and m-bit 
input, we know that the output 


N-1 N-1 
all =o el = OM 
i=0 i=0 


N- N-1 
Se = 2s 
i=0 i=0 
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so the accumulator requires no more than 
m+ litters DPM bits. This is in 
practice considerably fewer than the 
m+k+ ies n| bits implied by the input 
and coefficient precision. The desired 
accumulator width is, of course, readily 
expressed in the MATLAB language. (In 
Figure 1, the binary point in the fixed- 
point data is also taken into account.) 
When you know the input values have 
limited dynamic range, it may be possible 
to further tighten the bond. The accumu- 
lator width we just derived is a function of 
the input precision and the 
MATLAB array that stores the 
filter coefficients, so if you 
want to change the input pre- 
cision or change the filter 
itself, the same model will 
“right-size” the data path 
without any modification. 
The implications of this abili- 
ty to use the MATLAB inter- 
preter to customize your 
System Generator model 
should not be underestimated 
— it is unrivaled by any other 


design flow. 


Completing the FIR Filter 
Data Path 


As shown in Figure 2, the input data buffer 
is implemented as an SRL16E-based 
addressable shift register, and the filter 
coefficient buffer is implemented as a block 
memory. (Both are supplied as handcrafted 
Xilinx LogiCORE™ algorithms.) By stor- 
ing the filter coefficients in reverse order in 


memory, the same address counter can be 


used to drive both buffers. 


Because there are N multiply-accumulate 
operations per input sample, the filter must 
run internally at N times the data rate to 
supply a continuous data stream. The cap- 
ture register on the output of the MAC is 
used to latch the accumulated sum of prod- 
ucts, and its output is down-sampled by N 
to match the input data rate. This simple 
filter architecture is quite compact and effi- 


cient. A 64-tap non-symmetric filter with 
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12-bit coefficients and data requires only 
110 slices in a Virtex-I] XC2V250-6 
FPGA, and it runs at 195 MHz, or 3 Msps 
(ISE 4.21, production speeds files 1.96). 


System Generator provides many ways to 
tailor the implementation of a design, and 
changes are tracked automatically and 
transparently. In the FIR filter, you might 
choose to use dedicated embedded multi- 
pliers for the MAC engine, coupled with a 
block memory (BRAM) for the coefficient 
data. (It is natural to do so, because these 


resources are juxtaposed in the FPGA — 


a 





Figure 2 - A multiply-accumulate (MAC)-based FIR filter 


although both can also be implemented 
efficiently in the logic fabric.) Filter 
throughput can be increased significantly 
by employing additional MAC engines. 
System Generator makes this a straight- 
forward extension of our example. When 
you switch the implementation strategy, 
latencies are automatically adjusted as 
necessary in the System Generator model 


to match the hardware behavior. 
Other Implementation Options 


Of course, you do not have to build filters 
from scratch. There is an FIR filter block 
in the System Generator library that 
employs distributed arithmetic (DA) to 
map the computation into the FPGA. 
This is often the best way to implement a 


filter, because the underlying IP core sup- 


ports many efficient architectures, includ- 
ing fully parallel DA, fully serial DA, half 
band, polyphase interpolators and deci- 


mators, and more. 


At the same time, there are occasions when 
you will want to design a custom filter, for 
example, to exploit symmetries, take 
advantage of the embedded Virtex-II mul- 
tipliers, or to build adaptive filters. While 
providing a high-level simulation and 
modeling capability, System Generator 
also allows you to apply your knowledge of 
the FPGA to map your algorithm onto 
specific resources. As we saw in 
our example, you can often para- 
meterize the design using MAT- 
LAB functions, so that changing 
parameters automatically gives 
you an appropriately customized 


implementation. 
Conclusion 


With Virtex/Virtex-II family 
FPGAs, handcrafted IP 
LogiCORE algorithms, and 
System Generator software, 
Xilinx is rapidly changing the 
way people think about DSP. 
Designing a custom DSP data 
path processor in an FPGA has 


never been easier. 


This article barely scratches the surface of the 
capabilities of the Xilinx System Generator. 
The System Generator 2.2 release includes a 
number of useful annotated demonstration 
designs, including a QAM demodulator, 
concatenated codec for digital video broad- 
cast, discrete wavelet transform, and several 
extensions of the FIR filter discussed in this 
article, to name but a few. These demonstra- 
tion designs can be used as-is, or provide a 
starting point for you to create your own 
applications. A full-featured, free, 90-day 
evaluation version of System Generator 2.2 is 
available for download from the Xilinx web- 


site at www.xilinx.com/systemgenerator_dsp. 


For more information on MATLAB 
and Simulink software, visit 


www.mathworks.com. & 
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Use Virtex FPGAs and Xilinx Software Tools to 
Reduce Line Echoes in PSTN and Packet Networks 


By using Xilinx software and Virtex FPGAs with DSP capabilities, elntochips Ltd. brought its line-echo canceller 
fo market faster and better than a traditional DSP chip solution. 


by Nilesh Ranpura 
Project Manager, ASIC Group 
alnfochips Itd 
nilesh@einfochips.com 


Line echoes occur in public switched tele- 
phone networks (PSTN) at so-called 
“hybrid” points, where 2-wire and 4-wire 
copper lines are interconnected at the cen- 
tral office. Due to electric current leakage 
in the hybrid, a part of the signal energy is 
reflected back to the source 
of the signal, which causes 
an echo on the phone line. 
Likewise, in packet net- 
works, the delays associated 
with processing the voice 
stream can also produce an 
echo on the line. 


At elnfochips Ltd., we have 
developed an echo canceller 


using Virtex!M FPGAs. We 


chose Virtex devices because 





they give us high-speed per- 
formance and an edge in 
area-to-cost ratio over tradi- 


tional ASIC solutions. rar 
Bus 
Echo cancellers are advanced 


signal processing algorithms 

running on DSPs or fixed- 

function ASICs that remove 

line echoes in voice net- 

works. An adaptive FIR fil- 

ter is used to predict the echo from the his- 
tory of the transmitted signal. 


Echo cancellers can be integrated into a 
phone system or packet network along 
with other voice-processing functions, such 
as subtracter, double-talk detector, nonlin- 
ear processor, narrow band signal detector, 
PCM encoder/decoder, offset null filter, 
and controller/processor interface, as 
shown in Figure 1. 
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Signal Input Block 





Rin Null 


External Memory 
Interface 





Interface 





Virtex FPGAs: The Obvious Choice 


We started our project with the goal of cre- 
ating a single-channel echo canceller capa- 
ble of handling an echo tail length of 128 
ms. We had the option of using a DSP 
chip for algorithm implementation, but 
during the course of development, we dis- 
covered we could deploy a Virtex FPGA- 
based solution with DSP capabilities to 
achieve echo cancellation of up to 128 ms. 


Offset NLP + 
Null CNG 









ADP 


FIR 
Filter + Controller 
Double Talk 
Detector 


Control Register 


Main 
Controller 


Figure 1 - Echo canceller logic block diagram 





The choice was obvious. Not only do 
Virtex FPGAs have high-speed digital sig- 
nal processing functions, but they also 
include extensive flexibility in using block 


RAM and LUTs as RAM and ROM. 


We implemented the following major 
blocks as a part of the data path design in 
the FPGA: 


e Adaptive FIR filter for estimating the 


echo signal 


Signal Input Block 







Controller g 


Detection 


¢ Programmable double-talk detection 


threshold 


¢ Nonlinear processor with adaptive sup- 
pression threshold for removing the resid- 
ual echo signals 


¢ Offset null filtering of PCM channels 


e Narrow band signal detector conforming 
to ITU-T G.165 requirements for 
preventing the divergence of the adaptive 

filter coefficients 


Serial Data 
Output ¢ Selectable p/A law com- 


sour Panding, PCM coding, 
and sign magnitude. 
The core was verified 
using Industry Standard 
Vectors compliant with 


the G.165 standard. 


Rout 


Conclusion 


Xilinx Virtex FPGAs sup- 
port high density and 
speed with extensive EDA 


tools support on a wide 


NBSD + 


Tone spectrum of features. The 
combined solution of 
Virtex hardware (in this 
case, the Virtex XCV800 
FPGA), associated Xilinx 
Foundation™ Series soft- 
ware, and the Xilinx 
ChipScope™ integrated 
logic analysis tool saved us invaluable time 
to market. 


As a result, of implementing a total Xilinx 
Virtex solution, we were able to be among 
the first to bring our echo canceller solu- 
tion into the emerging areas of Internet 
gateway, AIM gateway, XDSL technology, 
cable modem, and wireless telephony. For 
more information visit www.einfochips.com 
or write info@einfochips.com. &% 
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SystemlQ Solution 





by Pegay Abusaidi 


Product Marketing Manager 

Xilinx, Inc. 

peggy.abusaidi@xilinx.com 

Traditional system interfaces, such 
as the existing PCI and VME 


parallel bus schemes, cannot keep 
up with today’s ever-growing bandwidth 
requirements. Therefore, RapidlIO™, 
HyperTransport™, InfiniBand™, PCI 
Express™ (aka 3GIO), and other high- 
speed bus interconnect technologies have 
been developed to open the I/O bottle- 
neck. Figure 1 illustrates the variety of 
Internet applications that drive the need 


for more bandwidth. 


How do you choose the right I/O interface 
standards for your systems now — and how 
do you keep current with the evolving I/O 


standards in the future? Moreover, how do 
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Demands 


system interconnectivity. 


Expands with Bandwidth 


The Xilinx Plattorm FPGA SystemlO solutions solve 
the emerging interface challenges for high-speed 





you meet your time-to-market and cost 
goals with minimal risk when the interface 


standards keep changing? 


As shown in Figure 2, Xilinx provides a 
comprehensive list of Platform FPGA 
SystemIO Solutions to help you solve the 
I/O bottleneck problem. SystemIO gives 
you the ability to implement designs using 
any of the latest I/O standards. By using a 
Xilinx Platform FPGA with a SystemIO 
solution, you can accelerate your time to 
market and be assured that your designs will 
remain current even as I/O standards evolve 
and specifications change. In addition, 
SystemIO solutions enable you to “future- 
proof” your products by allowing to you 


update the code in the FPGA to bring your 


product up to compliance — even after the 


product has been deployed in the field. 


Solving the Bandwidth Problem 


Today, most computer, embedded _ pro- 
cessing, and telecommunications equip- 
ment are parallel bus-oriented. However, 
as device performance increases, and 
demand for bandwidth rises, these multi- 
drop bus structures are reaching their per- 
formance limits. In most cases, these buses 
can only process one module at a time. 
This results in idle time for the subsystems 
waiting for bus access, thus decreasing 


overall system performance. 
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The industry’s response has been 
the development of a host of 


new interconnection schemes, 





some with data transfer rates 






exceeding 10 Gbps. The new Consumers 

proposed standards have backers = Need... ———_» 
like Intel, AMD, Microsoft, mennetp, an 
Hewlett-Packard (Compaq), & 

Dell, Xilinx, and Sun “= 


Microsystems. In addition to the 
aforementioned RapidIO, PCI 
Express, HyperTransport, and 
InfiniBand technologies, other 
standards already in use, or 
evolving toward higher levels of 
performance, include 10-gigabit 
attachment unit interface 
(XAUI), POS-PHY Level 4, 
Gigabit Ethernet, and various 
optical ATM formats like OC- 
12, OC-48, and OC-192. These new or 
evolving standards cover all architectural 
environments, such as motherboards in 
chip-to-chip communication; backplanes 
for communications between subsystems; 
and storage, local, and wide area networks 


(SANs, LANs, and WANs). 


Parallel bus standards are divided into two 


general categories: 


¢ System-Synchronous Parallel — the vener- 
able PCI family of buses, including PCI- 
X and Compact PCI. 


¢ Source-Synchronous Parallel — RapidIO, 
HyperTransport, Flexbus 4, POS-PHY 
Level 3/4, and other protocols. 


Because these standards are new and subject 
to change, FPGAs are the ideal way to ship 
a product without fear of obsolescence. 
Plus, the ability to use Xilinx LogiCORE™ 
intellectual property (IP) cores that imple- 
ment these complex and timing-critical 
buses gives you a fast, risk-free method to 


address a rapidly changing market. 


Clearly, the challenges you face are greater 
than ever. Not only are product life cycles 
shorter, simple evolutionary product steps 
are no longer sufficient with the multitude 
of standards hitting the market. When you 
consider that most of these standards are 


not final, the challenge to get your product 
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Figure 1- Internet bandwidth trend 


right on the first iteration is daunting. 
That's why the ability to rapidly change 
your design to meet changes in the stan- 
dards and markets is incredibly valuable. 
Further, the ability to leverage your design 
efforts with pre-engineered, pre-verified, 
and supported LogiCORE IP cores allows 
you to create more functionality in less 


time than ever before. 


The SystemlO Solution 
The Xilinx Platform FPGA SystemIO 


Solution uses the unique Virtex™-II] 


Protocol Eniyite 
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SelectI/O™-Ultra blocks to pro- 
vide the fastest and most flexible 
electrical interfaces available. 
Each user I/O pin is individually 
programmable for 19 single- 
ended I/O standards or six differ- 
ential I/O standards, including 
LVDS, SSTL, HSTL, and GTL+. 
The SystemIO solution is capable 
of delivering 840 Mbps double 
data rate (DDR) and 644 MHz 
single data rate (SDR) LVDS per- 
formance. Furthermore, any two 
I/O pins can be used as a differ- 
ential pair, providing maximum 
board layout flexibility and 
reducing overall system cost by 
saving both component and 


board layer costs. 


Using Virtex-I] 840 Mbps LVDS perform- 
ance and an abundance of memory and 
logic resources, you can easily create 
designs using 1GE (Gigabit Ethernet) and 
10OGE MAC, PCI and PCI-X, POS-PHY 
Levels 3 and 4, RapidIO, HyperTransport, 
and Flexbus 4 protocols. Free reference 
designs for implementing interfaces, such 
as SFI-4, XSBI, XGMII, and CSIX are 
available from the Xilinx website. Table 1 
shows the Platform FPGA SystemIO 
Solutions Summary, including both IP 


cores and reference designs. 


Payical metas 





Figure 2 - Xilinx Platform FPGA SystemIO Solutions 
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With the variety of interface reference designs 
and cores available in the SystemIO solution, 
you can easily customize your application. 
Figure 3 shows a 10 Gigabit Ethernet 
LAN/WAN line card example in which sever- 
al Platform FPGA SystemIO solutions are 
used to provide seamless interfaces to external 
PHYs and network processors. All of these 
interfaces are pre-engineered by Xilinx for 
easy drop-in functionality, enabling you to 


reduce your design cycle time. 


With our recent introduction of the Virtex- 
II Pro™ Platform FPGA, we have incorpo- 
rated Mindspeed’s SkyRail™ }architecture 
to embed up to 16 Rocket I/O™ 3.125 
Gbps multi-gigabit transceivers (MGTs), 
the highest number of MGTs in a single 
programmable device. This breakthrough 
technology supports all the popular emerg- 
ing serial I/O standards, including the PCI 
Express, InfiniBand, XAUI, and Fibre 


Channel protocols. 


The combination of SkyRail architecture 
and Selectl/O-Ultra technology supplies a 
comprehensive list of popular parallel and 
serial interface IP cores in the overall 
SystemIO solution. Additionally, both 1GE 
and 1OGE MAC IP cores are available with 


both parallel and serial interface options. 
Conclusion 


Platform FPGA SystemIO solutions enable 
Xilinx to provide you with the ultimate 
connectivity platform to connect and 
bridge interface requirements for your next- 


generation data communication systems. 


A terabit system can simultaneously contain 
multiple interface requirements, including 
both emerging serial I/O protocols, as well as 
established parallel I/O standards. The new 
Virtex-II Pro Platform FPGA is ideally suit- 
ed to help you address interface challenges 
by providing Rocket I/O MGTs for serial 
connectivity and SelectI/O-Ultra technology 


for parallel connectivity. 


Combining LogiCORE IP cores for proto- 
cols and data processing, along with design 
tools and third-party partnerships, Xilinx 
provides the complete system connectivity 
solution for today’s — and tomorrow's — 


high speed designs. %& 
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POS-PHY L3 


Standard Compliance Aggregate Bandwidth Performance = Select!/O Bus Availability 


OIF-SPI3-01.0 2.48 Ghps 32b 3.3V CMOS 
Saturn POS-PHY L3 
OIF-SP14-02.0 12.8 Gbps 800 Mbps per pair 16b LVDS 
Saturn POS-PHY L4 400 MHz DDR 
OIF-SP14-01.0 12.8 Gbps 64b HSTL 
AMCC Flexbus 4 


POS-PHY L4 


Flexbus 4 


64b HSTL 
(Flexbus 4 side) 


Flexbus 4 to OlF-SP14-01.0 
POS-PHY L4 AMCC Flexbus 4 
Bridge OlF-SP14-02.0 


200 MHz 
(Flexbus 4 side) 
350 MHz DDR 16b LVDS 
Saturn POS-PHY 14 (PL4 side) (PL4 side) 


1GE MAC w/ GMI IEEE 802.3-2000 1 Gbps 125 MHz 8b 3.3V GMIl 
Rocket 1/0 


1GE MAC IEEE 802.3-2000 1.25 Gbps 1.25 Gbps 
w/ PCS-PMA 
Transceiver 
156.25 MHz DDR | 32b XGMII HSTL 


3.125 Ghps 
per channel 


1 channel of 


10GE MAC 
w/ XGMIl 


Designed to 
IEEE P802.3ae 
draft 4.1 


Designed to 
IEEE P802.3ae 
draft 4.1 


10GE MAC 
w/ XAUI 


4 channels of 
3.125 Ghps 
Rocket 1/0 
Transceivers 


8b LVDS 


12.5 Gbps 


RapidlO PHY =| RapidlO Interconnect 8 Ghps 500 Mbps per pair 
Specification v1.1 250 MHz DDR 
PCl64 PCI Spec V2.3 264 -528 MBps 33/66 MHz 64b 5/3.3V PC 
PCI32 PCI Spec V2.3 132 -264 MBps 33/66 MHz 32b 5/3.3V PCI 


PCI-X 64/100 PCI-X Spec V1.00 800 MBps 66/100 MHz 64b 3.3V PCI-X 


HyperTransport | HyperTransport V1.010 6.4 Ghps 800 Mbps per pair 8b HT 1/0 
Single-Ended 400 MHz DDR 
Slave 
(SIX Reference (SIX-LI 6.4 Ghps 
Design 


XGMII Designed to 
Reference Design IEEE P802.3ae 
drat 4.1 


Q3 ‘02 


32b HSTL 


156.25 MHz DDR | 32b XGMII HSTL 


Table 1 - Summary of Platform FPGA SystemIO solutions 




























XGMII POS-PHYL4 POS-PHYL 
SURLUALLRLN @ 
c ra t z —— Fc 
ASSP ASSP | in | ASSP 7 * 
Integrated 10GE 4 < 4 “ 
: Optics PHY x H »  NPU Z = : S 
OC-192 POS LAN/WAN : A Z —— Oo 


| 
Bridge to 
Control Plane 


PCI 64/66, PCI-X 66 & 100, RapidlO, HyperTransport* 


PCI 64/66, PCI-X 66 & 100, RapidlO, HyperTransport* 
* Available in Q3/02 


Figure 3 - 10 Gigabit Ethernet 
LAN/WAN line card application example 
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MicroBlaze Development Kits trom Memec Design 
demonstrate the versatility of the MicroBlaze 
soft processor core in a variety of Xilinx FPGAs. 


by Jim Beneke 
Director ot Technical Marketing 
Insight Electronics 
jim_beneke@ins.memec.com 


125 MHz, the Xilinx 


MicroBlaze™ 32-bit soft processor core is 


Running at 


the industry’s fastest soft processing solu- 
tion. The MicroBlaze processor delivers a 
true 32-bit processor, critical for building 
complex systems for the networking, 


telecommunication, data communication, 
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embedded, and consumer 
markets. The soft processor fea- 
tures RISC architecture with 
Harvard-style separate 32-bit 
instruction and data busses, which run at 
full speed to execute programs and access 
data from both on-chip and external mem- 
ory. With 900 logic cells and 82 D-MIPS, 
the MicroBlaze processor meets the utiliza- 
tion, performance, and cost targets that 


most FPGA designers require. 


To support the development of 
MicroBlaze-based applications, Memec 
Design has introduced three different 


MicroBlaze Development Kits (MDKs) to 





accelerate design, prototype, and evalua- 
tion cycles. The three MDKs allow you to 
quickly prototype your MicroBlaze proces- 
sor and peripherals in real-world hardware 
environments. With functioning hardware 
platforms based on Xilinx FPGAs, you can 
easily verify design concepts, system inter- 


faces, and real-time functionality. 
Three Families — Three Kits 


The Virtex™-II, Spartan™-IIE, and 
Spartan-II FPGA families are the logical 
MicroBlaze 
Therefore, Memec offers specially designed 


targets for applications. 
standalone system boards for a device from 


each of these families. 


Additionally, each MDK offers an expansion 
module that contains common processor 
peripheral interfaces and memory, a pro- 
totype module for creating custom circuits, 
the Xilinx MicroBlaze processor license, 


software tools, and a power supply. 


¢ The Virtex-II platform is based on the one 
million-gate XC2V1000 device (Figure 1). 
The system board includes 2M x 16 DDR 
SDRAM and a 16-bit LVDS Tx/Rx port. 


¢The Spartan-IIE system board is based 
on the 300K-gate XC2S300E device 
(Figure 2). This board offers a 10-bit LVDS 
Tx/Rx port and 82 general purpose I/Os. 


¢ The Spartan-II kit uses the 200K-gate 
XC2S8200 device (Figure 3), which resides 
on a 32-bit PCI interface, along with 2M 
x 32 SDRAM. The PCI interface presents 
an interesting configuration by allowing 


the MicroBlaze processor to sit on the 


backend of the PCI bus. 


All three platforms include the P160 expan- 
sion slot, an ISP PROM, on-board power 


regulation, and other user support circuits. 


Summer 2002 








1/0 Connector 


1/0 Connector 





Figure 1 - Virtex-II system board block diagram 


P160 Expansion 
The P160 expansion slot provides the flex- 


ibility you need in a processor and IP devel- 
opment environment. The expansion slot 
allows you to easily plug custom peripheral 
modules into the system board and inter- 


face to the FPGA device. 


The P160 slot supplies 110 user-defined 
I/O signals from the FPGA to the user 
application circuit on the expansion card. 
By adding a P160 module, you can tailor 
your development environment and ease 


hardware verification. 





Figure 4 - P160 communications 
module block diagram 





Figure 5 - P160 prototype module block diagram 
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Figure 2 - Spartan-IIE system board block diagram 


Two P160 modules are included in the 
MDKs: 


¢The P160 communication module 
(Figure 4) provides interfaces for 10/100 
Ethernet, RS-232, USB, PS/2 keyboard, 
LCD module, I°C, and SPI connec- 
tions. The communication module also 
includes 2M x 32 flash and 256K x 32 
SRAM for off-chip memory expansion 
by way of the IBM CoreConnect™ 


on-chip peripheral bus architecture. 


¢The P160 prototype module (Figure 5) 
offers general purpose expansion headers 
for all 110 user I/O signals, as well as a 


prototype area for building custom circuits. 
Conclusion 


As MicroBlaze implementations move into 
the mainstream, you will need easy to use, 
flexible, low-cost hardware platforms to 
verify and validate your design concepts. 
Supporting Virtex-H, Spartan-IIE, and 
Spartan-II devices, each Memec Design 
MDK bundles the essential tools you need 


to explore MicroBlaze-based designs. 


The Virtex-I] MDK is priced at $795. The 
Spartan-I[E and Spartan-I] MDKs are avail- 
able for $695. Call 888-488-4133, ext. 235, 
or go to www.insight-electronics.com/ 
microblaze for more information or to order 


a MicroBlaze Development Kit. % 
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Figure 3 - Spartan-II system board block diagram 


MemecCore Group 
and Memec Design 


In addition to the MicroBlaze 
Development Kit, Insight Electronics 
and Impact Technologies offer 
MicroBlaze design service support 
through Memec Design. 

Our Xilinx-dedicated design services 
group can assist you in MicroBlaze and 
turnkey FPGA design, CoreConnect 
peripheral design, customization, 
integration, and FPGA performance 
optimization. Furthermore, the 
MemecCore Group focuses exclusively 
on offering Memec customers the 
highest quality intellectual property for 
tomorrow's designs. MemecCore offers 
several CoreConnect-enabled peripheral 
functions for easy integration into 
MicroBlaze applications. 

Visit www.insight-electronics.com 
or www.impact.eu.memec.com 
for more information on our 


complete MicroBlaze offerings. 
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Xyron ZOTS IP Core Uses 
Virtex-I| FPGA to Demonstrate 
Ns Brea kthrough sss 


technology gives a 40 MHz 
, _ FPGA.the realtime video 
 _ rendering power of a GH 


) .. desktop microprocessor” 
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by William Dress 

Director ot Strategic Development 
Xyron Semiconductor 
wdress@xyronsemi.com 


Xyron Semiconductor's patented “zero over- 
head task switch” (ZOTS™) technology 
promises to revolutionize both hard- 
ware and software design for embed- 
ded systems applications. Essentially, 
we remove the task-switching and 
interrupt control decision-making 
off the core microprocessor — which 
dramatically improves microproces- 
sor efficiency and provides end 
results similar to increasing a micro- 
processor's clock speed by several 


orders of magnitude. 


Our technological breakthrough is 
novel and radical. Never before has 
RTOS task management been imple- 
mented in hardware in such a manner. 
We knew we could attract customers 
if we could just effectively demon- 
strate the technology. Fortunately, the 
functionality of the Xilinx Virtex?-II 
FPGA gave us the solution. 


Show and Tell 


We chose to demonstrate our tech- 
nology by building on a public 
domain 32-bit RISC architecture. By 
introducing a simple, seven-task 
simultaneous environment contain- 
ing real-time video, stored video, and 
real-time audio, we have been able to 
demonstrate a fully functional multi- 
media microprocessor that handles 
interrupts on a pixel-by-pixel basis. 
With Xyronium™ _ technology 
embedded into a 32-bit RISC 
processor as a firm IP core in a Xilinx 
Virtex-II XC2V1000 FPGA, our 
demonstration board runs at only 40 MHz. 
Nevertheless, the board processes full- 
motion, full-frame video and stereo CD- 
quality audio — with enough performance 
power remaining to manipulate video in 
real time in a moving picture-in-picture 
synchronized with the audio amplitude 
(Figure 1). To see an _ interactive 
Macromedia® Flash™ demonstration, go 


to www.xyronsemi.com/xdemo. shtml. 
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A similarly burdened standalone desktop 
microprocessor programmed with the same 
RTOS capability and without video, audio, 
or memory support architecture would 
need to run at over 1.0 GHz to accomplish 


the same real-time tasks. 





Figure 2 - Xyron Semiconductor Edison development board 
incorporates a Xilinx Virtex-II XC2V1000 FPGA. 


Show and Sell 
The Xyron ZOTS approach is so revolu- 


tionary that we anticipated the initial mar- 
ket acceptance might be difficult. Potential 
customers might have difficulty believing 
you can implement traditional RTOS soft- 
ware task management functions in hard- 
ware — and realize significant microproces- 


sor performance improvements as a result. 


Therefore, we created a powerful stand- 
alone, single-board computer that show- 
cased our technology so users could see for 
themselves (Figure 2). Multiple video, 
audio, and user interfaces allow our cus- 
tomers the flexibility to design for any 

application in a familiar Xilinx 


FPGA environment. 


Using a Virtex-IIT XC2V1000 FPGA, 
the development board is an ideal envi- 
ronment to not only illustrate the 
power of the ZOTS innovation, but 
also to implement these powerful new 
technologies in real-world applications. 
Once the Xyron ZOTS firm IP micro- 
processor core is inserted into the fab- 
ric of a Virtex-II device, there is ample 
room to create custom circuitry to sup- 


port specific design requirements. 
Conclusion 


The path from demonstrating a radical 
new microprocessor technology to pro- 
viding a practical development environ- 
ment was optimized with the Virtex-II 
product family. The process only took a 


few months with a small design team. 


In all fairness, we explored other 
options, but they were not nearly as 
efficient. Without the functionality of 
the Virtex-II FPGA, it would not have 
been possible to construct such a robust 


development board as the Edison™ 






Development l "7 
Board. That's a Za 
why Xyron Cc — 


Semiconductor has become a partner in 


the Xilinx AllianceCORE™ program. 


In a classic win-win situation, com- 
bining Xyron technology with Xilinx 
products has given us the ability to 
realize standard-cell, dedicated IC 
performance in a flexible FPGA environ- 
ment. The Virtex-II product offering has 
enabled Xyron Semiconductor to take its 
first steps toward becoming a full-fledged 


microprocessor company. 


For more information on Xyron technolo- 
gy, please visit www.xyronsemi.com. To pur- 
chase a development board, go to 


www.xyronsemi.net. 
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by Art Stryer 

Senior Field Applications Engineer 
Elantec Semiconductor, ‘Ine 
astryer@elantec.com 


Virtex-II,Pro™ series Platform FPGAs use 
separate supply voltages for the core circuitry 
and I/O interface power supplies. Typically, 
the I/O interface requires 2.5V and the core 
circuitry requires 1.8V. As a designer, you 
need to minimize board spacevand maximize 
the reliability of the power supplies. 


Elantec Semiconductor Inc. has highly inte- 
grated solutions to fulfill your local power 
needs. Elantec’s family of integrated FET 
DC:DC converters provides optimal solu- 
tions for Virtex-II Pro designs. These power 
products are unique synchronous buck con- 
verters with integrated FETs and internal 
current sensing. These features, as well as 
other embedded functions, enable high effi- 
ciency and higher frequencies, which lead to 
smaller inductors. These features and func- 
tions require fewer external components, 
giving you. the advantage of minimum board 
space. Additionally, higher reliability is 


achieved through lower die temperature. 
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Integrated FET DC:DC Converters 


When you are designing a local power sup- 
ply for a Virtex-II Pro system, several factors 
allow you to achieve a small PC board area. 
Elantec power products include the follow- 
ing features: 


¢ Synchronous DC:DC switching regula- 


tors running at higher frequencies 
— Up to 1 MHz switching frequency 


¢ Higher frequencies, allowing use of small- 
er external capacitors and inductors 


— Steady peak-to-peak ripple voltage with 


fewer external parts 


¢ Higher integration, resulting in fewer 
external components 


— No external FETs 
¢ Reduced parts count. 
Embedded Functions Reduce ICs 


When designing a local power supply for a 
Virtex-II Pro system, you need certain fea- 
tures for a complete solution. To achieve 
these complete solutions usually requires 
additional ICs. Elantec parts, however, fea- 
ture embedded functions, including: 


¢ Current mode control 


— On-chip current sensing feedback for 
internal PWM controller 


¢ Over-current protection 
— Cycle-by-cycle current sensing and 
limiting 
¢ Over-temperature protection 


— Junction temperature monitor circuit 
on die 


—Control circuits with hysteresis for 
automatic supply restart 


¢ Adjustable switching frequency 


— User-selectable frequency to avoid 
switching interference 


e Soft start 
— Internal power-up control 
¢ Power sequencing 


— Sequential activation of multiple power 
supplies without additional control ICs 


e Simultaneous power tracking with con- 
trolled voltage out. 


Summer 2002 


High Reliability Components 


The following list of functions in Elantec 
power products improve the reliability of 


your power supply: 
¢ Synchronous converter 


— Both upper and lower power switches 


on-chip 
— Up to 95% efficiency 
¢ Integrated Power FETs 


— Reduced power dissipation versus exter- 


nal FETs with resistive connections 
¢ Advanced packages 


— Optional low die temperature, small 


HTSSOP. 
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Figure I - Integrated FET 4A and 8A 
DC:DC converter packages 


Part No. Type Supply Voltage 
EL7558BC 
EL7556BC 
EL7564C 
EL7563( 
EL7562C 
EL7551€ 


EL7512€ 


Output Voltage 


fk [estes [ea | 
rk [esressr | area | 
ris [esiess | areas | 
rik | awesa | areas | 
rik | esiess [area | 

ae 


4.5 to 5.5V 1.0V to 3.8V 
2.0V to 14V 5.0V to 16V 0.2 to 0.5A 


Table 1 - Elantec power products 
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Complete DC:DC Converter Family 


Different designs will have varying 
requirements for voltage and current, 
switching frequency, and available board 
space. The Elantec family of DC:DC 
products meets Virtex-II Pro local system 


needs by providing: 
¢ Continuous output currents 


— Products to choose from with currents 


up to 1A, 2A, 4A, GA, and 8A 
¢ Flexible output voltages 


— Adjustable voltages from 1.0V to 3.3V 


available from each product 
¢ Multiple package types 


— HTSSOP, QSOP, MSOP, HSOP, and 
SOL available. 


Table 1 shows a comprehensive summary 
of Elantec power products. Figure 1 shows 
packages and board area space for 4A and 
8A products. 


Conclusion 


Design engineers engaged in Virtex-II Pro 
board designs need to have power supplies 
that provide a complete solution for local 
power in their systems. Elantec DC:DC 
converters have all of the features and 
embedded functions to get the job done. 
For data sheets, application briefs, and 
additional information, visit Elantec online 
at www.elantec.com, or send an e-mail to 
tech@elantec.com. ¥& 


Package 
HSOP28 
$028 
HTSSOP28 
HTSSOP28 
QSOP16 
QSOP16 
MSOP10 


Output Current 
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Typical data handling and processing 


@ 
systems require a combination of high- 
() UJ ( [ fe AC UJ (t performance capabilities. Until now, these 
capabilities could only be achieved through 


combining various programmable logic 


Ntaitnrevenks seconc-generation architectures and processor technologies to 


. handle complex control algorithms. 
¢ QS ne modular DICH architecture The release of the Xilinx Virtex-II Pro™ 


has been designed 10 maximize Platform FPGAs, however, now gives us a 
: single, integrated package for manipulatin 
Virtex-lI Pro Platform TPGA complex eee seer cis As on 7 

: four IBM® PowerPC™ 405 processors can 
performance me bandwidth. be embedded deeply in the Virtex-II Pro 


programmable logic fabric. 


In order to create final systems based on 
Virtex-II Pro devices — and to harness maxi- 
mum power from these devices — appropri- 
ate hardware architecture is critical. The 
DIME™ and DIME-II™ architectures are 
open modular standards from Nallatech that 


create the framework for scalable systems. A 


; % small, dedicated FPGA-based system can be 

-" easily scaled to create high-end high-speed 

, a | , DSP applications through the DIME-II 
—, _ 

il > a plug-and-play capability. This architecture 


provides support for processor integration 


within FPGA-based systems, including the 


Rocket I/O 
8 x 3.125 Gbit 


DDR DDR 
SDRAM SDRAM 


DIME-II DIME-II 
VAT a (=). | ace) 


External 


Rocket I/O 
8 x 3.125 Gbit 


Figure I - BenPRO block diagram 
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use of ICE (in-circuit emulation) debug- 


gers, remote booting, and configuration. 


BenPRO Module 
The BenPRO™ module from Nallatech 


provides maximum scalability and maxi- 
mum communications bandwidth. The 
BenPRO™ module will be available with 
all Virtex-IT Pro devices that have embedded 
PowerPC processors. In this configuration, a 
single DIME-II module can have up to four 
hard processors and up to four million sys- 
tem gates of logic for computational and 


interfacing tasks. 


By utilizing up to 16 Rocket I/O™ trans- 
ceivers on a single Virtex-II Pro, and stan- 
dard I/O interfaces offered by DIME-II 
technology, you can get an I/O bandwidth 
in the order of 100 gigabits per second 


To support onboard data and program stor- 
age, four high-speed memory banks are built 
into the BenPRO module. As shown in 
Figure 1, DDR SDRAM and ZBT SRAM 
enable suitable memory architectures to be 
designed for tight integration with the algo- 


rithm or platform architecture required. 
Controlling Virtex-Il Pro Systems 
Nallatech’s FUSE™ (field upgradeable sys- 


tems environment) software supplies the 
reconfigurable computing operating system 
for FPGA-based systems. The powerful API 
(application programming interface) enables 
rapid development of systems and applica- 
tions based on FPGAs, either with or with- 
out, embedded processors. FUSE system 
software is portable and supports multiple 
operating systems, such as Linux, 


VXWorks®, and Windows® systems. 


Additionally, FUSE software supports a 
wide range of languages, including 
C/C++, Java™, MATLAB™ and 
DIMEscript™, Nallatech’s own dedicated 
language for controlling reconfigurable 


computing environments. 


FUSE software also supports multiple 
embedded processors, including the IBM 
PowerPC hard processor and _ the 
MicroBlaze™ soft processor from Xilinx — 
both of which can be embedded in one 
Virtex-II Pro Platform FPGA. The power of 
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FUSE system software provides for 
high-speed booting and data 
communication, as well as sys- 
tem control. The FUSE capa- 
bility allows full integration 
with high-level design 

tools from Xilinx, indus- 
try-standard _ proto- 

cols, and user appli- 
cation software. 
Figure 2 shows 
the _ relation- 
ship among 
FUSE, DIME- 
II, and Virtex-II 


Pro components. 











Figure 2 - How FUSE software 


Distributed Parallel 
Processing 


Although a single Virtex-II Pro device can 
have as many as four processors, DIME-II 
technology can scale systems far beyond this. 
For example, DIME-II interconnect tech- 
nology will enable you to tightly couple four 
BenPRO modules on a BenERA™ cPCI 
card, thus providing 16 PowerPC processors 
on a single card, as illustrated in Figure 3. 
With FUSE system control software, you 
can expand connectivity to multiple cPCI 
cards, leading to a virtually unlimited array 


of processors within the overall system. 


To benefit from this capability, the tools need 
to be in place for handling this distributed 
parallel processing sys- 


control works with Virtex-II Pro logic devices 


Conclusion 


Today, system designers require suitable 
software tools, development platforms, 
and modular hardware to efficiently create 
complex high-performance systems. With 
DIME-II hardware system architecture 
and the FUSE reconfigurable computing 
operating system, you can not only harness 
the full power of Virtex-II Pro Platform 
FPGAs, but you can also use Nallatech’s 
advanced engineering to bundle multiple 
devices within a system — taking system 
designs to unprecedented levels of com- 
plexity and functionality. & 








tem. Nallatech is 
enhancing its system 
design products to 
deliver the tools and 
communications infra- 
structure to support dis- 
tributed parallel pro- 
cessing. Once in place, 
the combined DIME-II 
and Virtex-II Pro tech- 
nologies will enable you 
to create a diverse range 
of system topologies 
that can take advantage 
of the high communi- 


cations bandwidths 
now available — and in 
the future. on BenPRO modules 








BenPRO Ml BenPRO 


Figure 3 - BenERA cPCI card with four Virtex-II Pro FPGAs 
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The system-on-a-chip (SOC) market has 


experienced steady and _ consistent 


growth. Dataquest estimates roughly half 
of all ASIC design starts are SOC based. 
That percentage is projected to reach 
80% by 2005. There are several reasons 
for this clear shift in design methodology. 
Some of the obvious advantages include 
greater Component integration, increased 
speed (Logic <-> Processor), lower 
packaging and test costs, and 
increased overall system reliability. 
All of these combined can potential- 
ly make significant contribu- 
tions to achieving the often elu- 

sive but always important goal of 


accelerated time to market. 


Programmable logic device vendors 

have entered the SOC solution space 
with the introduction of a new class of 
devices known as Platform FPGAs — 
with the most recent addition being the 
Virtex-II Pro™ Platform FPGA. These 
devices offer the same level of integration as 
ASIC SOCs, but in contrast, Platform 
FPGAs facilitate the development of a wide 
range of applications on the same chip. This 
article focuses on seven of the key advan- 


tages of the Platform FPGA approach. 
#1 — Pre-Engineered Platform 


Platform FPGAs integrate several fixed 
and predetermined blocks of hard IP 
(intellectual property) components (sys- 
tem elements) within the programmable 
fabric. Notable among these are high-per- 
formance RISC CPUs, multi-gigabit and 
high speed I/Os, block RAM, system 
clock management, and dedicated DSP 
processing hardware. This powerful 
assembly and harmonious blend of com- 
ponents creates a cohesive system design 
environment. This environment offers 
unprecedented flexibility and perform- 
ance, thus enabling the deployment of a 


wide range of applications. 


The critical technological breakthrough is 
in the ability to tightly interface the vari- 
ous elements into the programmable fab- 


ric. Without this tight integration, much 
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of the speed benefits could not be realized. 
The fact that the choice of system elements 
is already made greatly simplifies the 
design and development process. A fixed 
architecture is particularly beneficial for 
software tool and IP providers in allowing 


them to deliver better value, customiza- 


tion, and architecture-optimized solutions. 





The assembly of the various hybrid IP 
blocks in an ASIC adds substantial com- 
plexity and hardship to the users’ design 
and development environment, because of 
a variety of issues relating to tool and IP 
interoperability, physical layout, timing, 
and system verification. For Platform 
FPGAs, on the other hand, it is much eas- 
ier to tailor and optimize components — 
such as silicon, software, support, and IP — 
because FPGAs represent a fixed and pre- 


engineered target. 


Summary: A fixed, pre-engineered but 
programmable FPGA solution offers a 
more productive and efficient develop- 
ment environment from both the software 


and silicon perspective. 
#2 — Process Technology 


PLD vendors have been able to extract 
great value and benefits from Moore’s Law 
and shrinking device geometries. While 
the majority of ASIC design starts are at or 
higher than 180 nm, FPGAs have raced 
ahead to bring the cost and performance 
advantages of 130 nm to its customers. 
This ability to rapidly migrate to the lead- 


ing edge of technology is essential to deliv- 


ering the performance, capacity, and inte- 
gration that is necessary to challenge and 
displace the current established platforms 


of system design. 


For instance, FPGAs now make it entirely 
feasible to build systems of up to 2M (ASIC) 
gates with CPU(s) running at 400 MHz, 
serial I/O channels at 3.125 Gbps CLB fab- 
ric-switching at 300 MHz, and the entire 
system clocking over 150 MHz. This sur- 
passes the projected sweet spot of ASIC 
SOC designs. 


Looking at it in pure economic terms, 

the costs for a typical mask set for a 
130 nm ASIC run into the $700K+ 
range. That raises the bar for entry into 
the ASIC space, making the Platform 
FPGA an even more attractive option for a 


growing percentage of all SOC designs. 


Summary: FPGA silicon is best-in-class in 
process and engineering, thereby deliver- 


ing best-in-class system performance. 
#3 — Software Tools and Methodology 


It is well known that EDA designer pro- 
ductivity for ASICs is lagging behind 
recent silicon advances. The ability to cre- 
ate a productive design and debug environ- 
ment is absolutely essential to the success of 
any silicon platform. Therefore, software 
plays a critical role in not only making the 
platform easy to use and work with, but 
also in extracting the most performance 


and device utilization out of the silicon. 


The goal is to insulate the user from hav- 
ing to learn extraneous details about the 
platform, yet providing empowerment and 
control when and where it is needed. To 
this end, FPGA vendors have created cus- 
tomized and user-friendly processor sys- 
tem generator tools that aid in instantia- 
tion, initialization, and configuration of all 
the various system component blocks. In 
addition, these software tools automate 
otherwise manual and error-prone tasks, 
such as the interconnections among the 


processor, its peripherals, and buses. 


Design entry is engineered to tightly cou- 
ple the HW/SW domains. Such engineer- 
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Figure I - Software makes use of idle processor time. 


ing leads to not only simplified and easy- 
to-use design flows, but it ensures reliabil- 
ity and robustness from the very start by 
performing design rule and data consisten- 


cy checks across the two domains. 


Two examples of the advantages of cross- 


domain HW/SW co-design include: 


e Automatic generation of device drivers 
and header files for SW engineers once 
a particular block is instantiated by the 
HW designer 


¢ Tools to automatically populate SW 
binary code into appropriate FPGA 


memory bitstreams. 
Summary: Software sells silicon. 
#4 — Advanced Debug 


The importance of finding problems early 
cannot be overstated. Yet, up-front ASIC 
verification is extremely designer- and 


computer-resource intensive. Compared 
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to systems on a board, SOC limits visibil- 
ity into the internal nodes of the system, 
making the task of verification and debug 


more challenging than ever. 


The critical part includes the verification 
of complex interactions between the 
application software and the custom- 
designed peripheral hardware. ‘Traditional 
HDL-centric verification and debug tech- 
niques can no longer deal with the rising 


complexity of system designs. 


Consider a typical application, such as 
MPEG A/V decoding, where a large 
number of simulation cycles are required 
to complete a small sequence of frames. 
Co-verification tools in this case would 
either take an impractically long time to 
complete or validate only a mere fraction 
of the software code, falling far short of 
what it takes to find problems in the 
HW/SW interface. 


FPGAs overcome this problem in large 
part by being able to provide access to real 
or near real targets at a very early stage in 
the design cycle. Among other things, this 
means that SW engineers using FPGAs 
can quickly and easily sort out logic and 
design flaws by targeting real silicon. The 
engineers do not have to rely on ineffi- 
cient ASIC-centric techniques like co-ver- 
ification or writing stub code. Application 
software can be debugged at system speeds 
with full hardware and software register 


access and control. 


Additionally, the FPGA fabric allows for 
construction of highly customized and 
value-added cores to enable powerful real- 
time, on-chip debug capability. Some 


examples of such instrumentation include: 
¢ Logic and bus analyzer functions 
¢ Bus protocol compliance monitors 


¢ Memory buffers for debug and trace 
port data 


¢ Cross-domain triggers and breakpoints 
e Hardware run control of the CPU 
¢ HW/SW time synchronization logic. 


Furthermore, a single cable is able to per- 
form multiple functions, like debugging 
hardware, debugging software, as well as 
programming the FPGAs. This greatly 
simplifies the lab setup making it much 


easier to exploit the debug advantages. 


Summary: Platform FPGAs offer a clear- 
er, more cohesive, and overall more effec- 
tive debug strategy. Specifically, Platform 
FPGAs offer up-front silicon access along 
with unprecedented visibility and control 
of the processor and its peripherals resid- 


ing in the programmable fabric. 


#5 — Top Tier Partnerships and 
Vendor Tools Support 


In extending the concept of traditional 
programmable logic to Platform FPGAs, 
certain critical technologies have had to 
be developed or acquired. One of most 
exciting aspects brought forth by FPGA 


vendors has been to successfully forge 
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strategic partnerships with vendors hold- 
ing various system technology compo- 
nents. Driven largely by the successes of 
the FPGA business models, leading ven- 
dors have been eager to partner in fulfill- 
ing the vision of building powerful pro- 


grammable systems platforms. 


Areas of cooperation include partnerships 
in the form of cutting-edge process tech- 
nology, powerful mainstream processing 
elements — such as RISC CPU, high- 
speed I/O — and other components 
deemed of significance to a system design 
platform. IBM, Conexant Systems, Inc., 
Wind River Systems, and other high pro- 
file vendors are currently engaged in such 
strategic partnerships. What this means to 
you is there is no need for these partners 
to negotiate licensing, royalty, and inte- 
gration issues with individual vendors, 
thereby greatly reducing your engineering, 


management, and accounting overhead. 


The appeal and draw of FPGAs has 
caught the attention of independent SW 
tool vendors. Increasing numbers of ven- 
dors are able to sustain business models 
selling to FPGA customers. Several new 
vendors are setting up shop to write cus- 
tom tools to help enhance and exploit the 


unique capabilities of Platform FPGAs. 
Summary: The FPGA business model has 


attracted top-tier silicon and IP vendors to 
forge strategic partnerships to create pow- 
erful system design platforms. Software 
vendors are able to financially justify 
investment in research and development 
leading to a continuous stream of an 


increasing number of innovative solutions. 
#6 — Application Space: Co-Design Flexibility 
HW combined with 


processors on a single chip softens the 
HW/SW design boundary. By using 


hardware-assisted architectural explo- 


Programmable 


ration, designers can optimally search for 
the right HW and SW partitioning, 
which leads to the increased probability 
of being able to meet performance and 
Sequential 


area targets. computing, 


exception handling, and control func- 
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tions, for instance, programmed in HW 
could be implemented in SW running on 


the processor to save silicon. 


On the other hand, SW structures and 
algorithms — which can be broken into 
parallel, non-blocking processes — can 
achieve significant speed and data 
throughput improvements by implement- 
ing them in HW. In fact, software engi- 
neers represent a new and emerging mar- 
ket for Platform FPGAs. Aided by design 
tools, it is now easier than ever for SW 
engineers to explore concepts of software 
acceleration via HW. Both hardware and 
firmware are reprogrammable and field 
upgradeable, which enables the develop- 
ment and deployment of several product 
generations from one base. This translates 
into a much broader applicability than 
ASICs and ASSPs. 


In addition to being suitable for building 
complex embedded applications, HW engi- 
neers can utilize an otherwise idle processor 
to run relevant portions of their design 
algorithms or control logic. As Figure 1 
shows, the CPU can serve as a simple 
microcontroller running a single-threaded 
PLD vendors add value by 


providing software device drivers and 


application. 


library functions for rapid implementation 
without requiring the HW engineer to have 


detailed knowledge of SW practices. 


Summary: The Platform FPGAs provide the 
most flexible co-design platform by enabling 
dynamic HW-SW design partitioning. 


#7 — Risk Management 


When compared to Platform FPGAs, ASIC 
SOCs present a huge design risk. With 
more variables and issues to worry about, 
and with limited debug capabilities 


when mistakes are found, 


ASIC designers are 


forced to either resolve problems subopti- 
mally in software, or in the worst case sce- 
nario, they are forced to respin the silicon at 


great cost and loss of time to market. 


Reprogrammability comes up big here. 
Programmable platforms allow early 
access to silicon. Engineers can validate 
performance and functionality in real sil- 
icon, leading to greater confidence in the 
reliability of the completed system. The 
programmability of the platform allows 
itself to be debugged and upgraded even 
after the system has been deployed. This 
helps promote and protect the time-to- 
market advantage by alleviating a large 


part of the risk. 


Summary: Programmable Platform 
FPGAs provide better control over the 
life cycle management of products by 
minimizing the cost and time penalty for 


design errors and specification changes. 
Conclusion 


In an era when SOCs continue to domi- 
nate mainstream design, Platform FPGAs 
are emerging to take a share of the spot- 
light from ASICs. While ASIC SOCs have 
and will continue to be strong in certain 
segments — like low power, small form fac- 
tor, and high-volume, cost-sensitive con- 
sumer electronic gadgets — an increasing 
range of other infrastructure applications 
in areas such as networking, telecommu- 
nications, industrial electronics, and data 
storage have compelling reasons to move 


to a programmable platform. & 
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2. XILINX ® 


DS090 (v1.0) January 3, 2002 


Features 


¢ Optimized for 1.8V systems 
-  Industry’s fastest low power CPLD 
- Static Icc of less than 100 microamps at all times 
- Densities from 32 to 512 macrocells 
¢ Industry’s best 0.18 micron CMOS CPLD 
- Optimized architecture for effective logic synthesis 
- Multi-voltage I/O operation — 1.5V to 3.3V 
¢ Advanced system features 
- Fastest in system programming 
1.8V ISP using IEEE 1532 (JTAG) interface 
- |EEE1149.1 JTAG Boundary Scan Test 
- Optional Schmitt trigger input (per pin) 
- Unsurpassed low power management 
- FZP 100% CMOS product term generation 
- DataGATE external signal control 
- Flexible clocking modes 
Optional DualEDGE triggered registers 
Clock divider (+ 2,4,6,8,10,12,14,16) 
CoolCLOCK 
- Global signal options with macrocell control 
Multiple global clocks with phase selection per 
macrocell 
Multiple global output enables 
Global set/reset 
- Abundant product term clocks, output enables and 
set/resets 
- Efficient control term clocks, output enables and 
set/resets for each macrocell and shared across 
function blocks 
- Advanced design security 
- Open-drain output option for Wired-OR and LED 
drive 
- Optional bus-hold or weak pullup on selected I/O 
pins 
- Optional configurable grounds on unused |/Os 
- Mixed I/O voltages compatible with 1.5V, 1.8V, 
2.5V, and 3.3V logic levels on all parts 


Table 1: CoolRunner-ll CPLD Family Parameters 


CoolRunner-Il CPLD Family 


Advance Product Specification 


SSTL2-1,SSTL3-1, and HSTL-1 on 128 macro- 
cell and denser devices 
- PLA architecture 
Superior pinout retention 
100% product term routability across function 
block 
- Hot pluggable 
- Wide package availability including fine pitch: 
Chip Scale Package (CSP) BGA, Fine Line 
BGA, TQFP, PQFP, VQFP, and PLCC packages 
- Design entry/verification using Xilinx and industry 
standard CAE tools 
- Free software support for all densities using Xilinx 
WebPACK™ or WebFITTER™ tools 
- Industry leading nonvolatile 0.18 micron CMOS 
process 
- Guaranteed 1,000 program/erase cycles 
- Guaranteed 20 year data retention 


Family Overview 


Xilinx CoolRunner™-I| CPLDs deliver the high speed and 
ease of use associated with the XC9500/XL/XV CPLD fam- 
ily with the extremely low power versatility of the XPLA3™ 
family in a single CPLD. This means that the exact same 
parts can be used for high-speed data communications/ 
computing systems and leading edge portable products, 
with the added benefit of In System Programming. Low 
power consumption and high-speed operation are com- 
bined into a single family that is easy to use and cost effec- 
tive. Xilinx patented Fast Zero Power™ (FZP) architecture 
inherently delivers extremely low power performance with- 
out the need for any special design measures. Clocking 
techniques and other power saving features extend the 
users power budget. The design features are supported 
starting with Xilinx ISE 4.11, WebFITTER, and ISE Web- 
PACK. 


Table 1 shows the macrocell capacity and key timing 
parameters for the CoolRunner-II] CPLD family. 























XC2C32 XC2C64 XC2C128 XC2C256 XC2C384 XC2C512 
Macrocells 32 64 128 256 384 Se 
Max I/O Gre! 64 100 184 240 270 
Tpp (ns) B.8 4.0 4.5 5.0 555 6.0 
Toy (ns) flee 2.0 2.1 2.2 has 2.4 
Teco (ns) 2.8 3.0 3.4 3.8 4.2 4.6 
Foystem (MHz) 303 270 244 222 204 189 
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CoolRunner-Il CPLD Family 


Table 2 shows the CoolRunner-I| CPLD package offering 
with corresponding I/O count. All packages are surface 
mount, with over half of them being ball-grid technologies. 
The ultra tiny packages permit maximum functional capacity 
in the smallest possible area. The CMOS technology used in 
CoolRunner-Il CPLDs generates minimal heat, allowing the 
use of tiny packages during high-speed operation. 
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There are at least two densities present in each package 
with three in the VQ100 (100-pin 1.0mm QFP) and TQ144 
(144-pin 1.4mm QFP), and in the FT256 (256-ball 1.0mm 
spacing FLBGA). The FT256 is particularly important for 
slim dimensioned portable products with mid- to high-den- 
sity logic requirements. 


Table 2: CoolRunner-Il CPLD Family Packages and I/O Count 
































XC2C32 XC2C64 XC2C128 XC2C256 XC2C384 XC2C512 
PC44 33 33 ; : : : 
VQ44 33 33 ; : - : 
CP56 33 45 = : : = 
VQ100 - 64 80 80 > = 
Cialse - = 100 106 - : 
TQ144 - - 100 1 fe 118 = 
PQ208 - - = 17S 173 7s 
mliz56 : - : 184 Pas Ved 212 
FG324 = = - - 240 2/0 


























Table 3 details the distribution of advanced features across 
the CoolRunner-Il CPLD family. The family has uniform 
basic features with advanced features included in densities 
where they are most useful. For example, it is very unlikely 
that four I/O banks are needed on 32 and 64 macrocell 
parts, but very likely they are for 384 and 512 macrocell 
parts. The I/O banks are groupings of I/O pins using any one 


Table 3: CoolRunner-ll CPLD Family Features 


of a subset of compatible voltage standards that share the 
same Vccio level. (See Table 4 for a summary of 
CoolRunner-lIl I/O standards.) The clock division capability 
is less efficient on small parts, but more useful and likely to 
be used on larger ones. DataGATE, an ability to block and 
latch inputs to save power, Is valuable in larger parts, but 
brings marginal benefit to small parts. 















































XC2C32 XC2C64 XC2C128 XC2C256 XC2C384 XC2C512 
Elele 682 / J J J J J 
I/O banks | 1 Z 2 4 4 
Clock division - - Wi J J WA 
Clock doubling J J J J J Wf, 
DataGATE : - J J J J 
LVTTL J J / J / / 
LVCMOS33, 25, 18, and 1.5V I/O J J J J J J 
SSTL2-1 : - J / J J 
SSTL3-1 : : J J J J 
HSTL-1 ; : J J J J 
Configurable ground J J J J J J 
Quadruple data security JV Wf J J 7. J 
Open drain outputs 7. J J J J J 
Hot plugging J J v7. J / J 
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Architecture Description 


CoolRunner-Il CPLD is a highly uniform family of fast, low 
power CPLDs. The underlying architecture is a traditional 
CPLD architecture combining macrocells into Function 
Blocks (FBs) interconnected with a global routing matrix, the 
Xilinx Advanced Interconnect Matrix (AIM). The Function 
Blocks use a Programmable Logic Array (PLA) configuration 
which allows all product tems to be routed and shared 
among any of the macrocells of the FB. Design software can 
efficiently synthesize and optimize logic that is subsequent- 
ly fit to the FBs and connected with the ability to utilize a 
very high percentage of device resources. Design changes 
are easily and automatically managed by the software, 
which exploits the 100% routability of the Programmable 
Logic Array within each FB. This extremely robust building 
block delivers the industry’s highest pinout retention, under 


BSC Path 


CoolRunner-Il CPLD Family 


very broad design conditions. The architecture will be 
explained by expanding the detail as we discuss the 
underlying Function Blocks, logic and interconnect. 


The design software automatically manages these device 
resources so that users can express their designs using 
completely generic constructs without knowledge of these 
architectural details. More advanced users can take 
advantage of these details to more thoroughly understand 
the software's choices and direct its results. 


Figure 1 shows the high-level architecture whereby 
Function Blocks attach to pins and interconnect to each 
other within the internal interconnect matrix. Each FB con- 
tains 16 macrocells. The BSC path is the JIAG Boundary 
Scan Control path. The BSC and ISP block has the JTAG 
controller and In-System Programming Circuits. 


Clock and Control Signals 


Function 
Block 1 
VO Pin KS MC1 
VO Pin KK 3 MC2 
[co] 16 FB 
| eo 
| eo 
oan 
2 | eo 
° Is Pe | PLA 
° | a Te i AIM 
a2 = — 
oa 
| eo 
| eo 
| eo 
I/O Pin KS MC16 
16 Fast Inputs 


Function 
Block n 
MC1 K > 1/0 Pin 
MC2 K > 1/0 Pin 
16 FB |e 
| eo 
[Os] 
|e 
[os] 2 
PLA |e | 8 |\fee 
Ae Som a ;. 
SS cae © 
eeiee| 
| eo 
os 
| eo 
| eo 
MC16 KS /0 Pin 
Fast Inputs 


DS090_01_ 121201 


Figure 1: CoolRunner-Il CPLD Architecture 
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Function Block 


The CoolRunner-I| CPLD Function Blocks contain 16 
macrocells, with 40 entry sites for signals to arrive for logic 
creation and connection. The internal logic engine is a 56 
product term PLA. All Function Blocks, regardless of the 
number contained in the device, are identical. For a high- 
level view of the Function Block, see Figure 2. 


40 Out 
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Global Global 
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Figure 2: CoolRunner-ll CPLD Function Block 


At the high level, it is seen that the product terms (p-terms) 
reside in a programmable logic array (PLA). This structure is 
extremely flexible, and very robust when compared to fixed 
or cascaded product term function blocks. 


Classic CPLDs typically have a few product terms available 
for a high-speed path to a given macrocell. They rely on 
capturing unused p-terms from neighboring macrocells to 
expand their product term tally, when needed. The result of 
this architecture is a variable timing model and the possibil- 
ity of stranding unusable logic within the FB. 


The PLA is different — and better. First, any product term 
can be attached to any OR gate inside the FB macrocell(s). 
Second, any logic function can have as many p-terms as 
needed attached to it within the FB, to an upper limit of 56. 
Third, product terms can be re-used at multiple macrocell 
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OR functions so that within a FB, a particular logical prod- 
uct need only be created once, but can be re-used up to 
16 times within the FB. Naturally, this plays well with the 
fitting software, which identifies product terms that can be 
shared. 


The software places as many of those functions as it can 
into FBs, so it happens for free. There is no need to force 
macrocell functions to be adjacent or any other restriction 
save residing in the same FB, which is handled by the 
software. Functions need not share a common clock, 
common set/reset or common output enable to take full 
advantage of the PLA. Also, every product term arrives 
with the same time delay incurred. There are no cascade 
time adders for putting more product terms in the FB. 
When the FB product term budget is reached, there is a 
small interconnect timing penalty to route signals to anoth- 
er FB to continue creating logic. Xilinx design software 
handles all this automatically. 


Macrocell 


The CoolRunner-ll CPLD macrocell is extremely efficient 
and streamlined for logic creation. Users can develop sum 
of product (SOP) logic expressions that comprise up to 40 
inputs and span 56 product terms within a single function 
block. The macrocell can further combine the SOP expres- 
sion into an XOR gate with another single p-term expres- 
sion. The resulting logic expression’s polarity is also selec- 
table. As well, the logic function can be pure combinatorial 
or registered, with the storage element operating selec- 
tably as a D or T flip-flop, or transparent latch. Available at 
each macrocell are independent selections of global, func- 
tion block level or local p-term derived clocks, sets, resets, 
and output enables. Each macrocell flip-flop is config- 
urable for either single edge or DualEDGE clocking, pro- 
viding either double data rate capability or the ability to dis- 
tribute a slower clock (thereby saving power). For single 
edge clocking or latching, either clock polarity may be 
selected per macrocell. CoolRunner-|l macrocell details 
are shown in Figure 3. Note that in Figure 3, standard logic 
symbols are used except the trapezoidal multiplexers have 
input selection from statically programmed configuration 
select lines (not shown). Xilinx application note XAPP376 
gives a detailed explanation of how logic is created in the 
CoolRunner-Il CPLD family. 


Xcell Journal /7 


> XILINX® 


From AIM 














lo PVA, PWE. (IKE cli 
other macrocells 


Fast Input 
from 


CTC, CTR, /O Block 


Cie Cie 


PTA 
CTS 
GSR 
GND 


PLA OR Term 


GCKO 
GCK1 
GCK2 


CTC 
PTA 
oe CTR 
GSR 
GND 


CoolRunner-Il CPLD Family 


Feedback 
to AIM 


Pe ete 
ptr 


PTC ce UFIF 
L]Latch 


ck UDualEDGE 





R 


DS090_03_ 121201 


Figure 3: CoolRunner-Il CPLD Macrocell 


When configured as a D-type flip-flop, each macrocell has 
an optional clock enable signal permitting state hold while a 
clock runs freely. Note that Control Terms (CT) are available 
to be shared for key functions within the FB, and are gener- 
ally used whenever the exact same logic function would be 
repeatedly created at multiple macrocells. The CT product 
terms are available for FB clocking (CTC), FB asynchronous 
set (CTS), FB asynchronous reset (CTR), and FB output 
enable (CTE). 


Any macrocell flip-flop can be configured as an input regis- 
ter or latch, which takes in the signal from the macrocell’s 
I/O pin, and directly drives the AIM. The macrocell combina- 
tional functionality is retained for use as a buried logic node 
if needed. 


Advanced Interconnect Matrix (AIM) 


The Advanced Interconnect Matrix is a highly connected low 
power rapid switch. The AIM is directed by the software to 
deliver up to a set of 40 signals to each FB for the creation 
of logic. Results from all FB macrocells, as well as, all pin 
inputs circulate back through the AIM for additional connec- 
tion available to all other FBs as dictated by the designsoft- 
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ware. The AIM minimizes both propagation delay and 
power as It makes attachments to the various FBs. 


I/O Block 


I/O blocks are primarily transceivers. However, each I/O is 
either automatically compliant with standard voltage ranges 
or can be programmed to become so. 


In addition to voltage levels, each input can selectively 
arrive through Schmitt-trigger inputs. This adds a small time 
delay, but substantially reduces noise on that input pin. 
Hysteresis also allows easy generation of external clock cir- 
cuits. The Schmitt-trigger path is best seen in Figure 4. 


Outputs can be directly driven, 3-stated or open-drain con- 
figured. A choice of slow or fast slew rate output signal is 
also available. Table 4 summarizes various supported volt- 
age standards associated with specific part capacities. All 
inputs and disabled outputs are voltage tolerant up to 3.3V. 


Figure 4 details the I/O pin, where it is noted that the inputs 
requiring comparison to an external reference voltage 
(SSTL2-1, SSTL3-1, HSTL-1) are available on the 128 
macrocell and denser parts. 
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Figure 4: CoolRunner-ll CPLD I/O Block Diagram 


Table 4 summarizes the single ended |/O standard support 
and shows which standards require VpeF values and board 
termination. VREF detail is given in specific data sheets. 


Table 4: CoolRunner-Il CPLD I/O Standard Summary 





























Input | Board Termination 
I/O Standard | Vecio| Vrer Voltage (Vr7) 
Ly Pie che N/A N/A 
LVCMOS33 3.3 N/A N/A 
LVCMOS25 2) ES) N/A N/A 
LVCMOS18 1.8 N/A N/A 
1.5V I/O 125 N/A N/A 
HSTL-1 eS) ON TAS ON75 
SSTL2-1 2.9 e235 23 
SSTL3-1 Sins 1.5 ies 

















Output Banking 


CPLDs are widely used as voltage interface translators. To 
that end, the output pins are grouped in large banks. The 
smallest parts are not banked, so all signals will have the 
same output swing for 32 and 64 macrocell parts. The 
medium parts (128 and 256 macrocell) support two output 
banks. With two, the outputs will switch to one of two select- 
ed output voltage levels, unless both banks are set to the 
same voltage. The larger parts (884 and 512 macrocell) 
Support four output banks split evenly. They can support 
groupings of one, two, three or four separate output voltage 
levels. This kind of flexibility permits easy interfacing to 
3.3V, 2.5V, 1.8V, and 1.5V in a single part. 
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DataGATE 


Low power is the hallmark of CMOS technology. Other 
CPLD families use a sense amplifier approach to creating 
product terms, which always has a residual current com- 
ponent being drawn. This residual current can be several 
hundred milliamps, making them unusable in portable 
systems. CoolRunner-ll CPLDs use standard CMOS 
methods to create the CPLD architecture and deliver the 
corresponding low current consumption, without doing 
any special tricks. However, sometimes designers would 
like to reduce their system current even more by selec- 
tively disabling circuitry not being used. 


The patented DataGATE technology was developed to per- 
mit a straightforward approach to additional power reduc- 
tion. Each I/O pin has a series switch that can block the 
arrival of free running signals that are not of interest. 
Signals that serve no use may increase power consump- 
tion, and can be disabled. Users are free to do their design, 
then choose sections to participate in the DataGATE func- 
tion. DataGATE is a logic function that drives an assertion 
rail threaded through the medium and _ high-density 
CoolRunner-Il CPLD parts. Designers can select inputs to 
be blocked under the control of the DataGATE function, 
effectively blocking controlled switching signals so they do 
not drive internal chip capacitances. Output signals that do 
not switch, are held by the bus hold feature. Any set of input 
pins can be chosen to participate in the DataGATE function. 
Figure 5 shows the familiar CMOS Icc versus switching 
frequency graph. With DataGATE, designers can approach 
zero power, should they choose to, in their designs 


Figure 6 shows how DataGATE basically works. One I/O 
pin drives the DataGATE Assertion Rail. It can have any 
desired logic function on it. It can be as simple as mapping 
an input pin to the DataGATE function or as complex as a 
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counter or state machine output driving the DataGATE I/O world through a pin, for inspection. If DataGATE is not 
pin through a macrocell. When the DataGATE rail is needed, this pin is an ordinary I/O. 

asserted low, any pass transistor switch attached to it Is 

blocked. Note that each pin has the ability to attach to the 

AIM through a DataGATE pass transistor, and thus be 

blocked. A latch automatically captures the state of the pin 

when it becomes blocked. The DataGATE Assertion Rail 

threads throughout all possible I/Os, so each can partici- 

pate if chosen. Note that one macrocell is singled out to 

drive the rail, and that macrocell is exposed to the outside 


0 Frequency —————_»> 
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Figure 5: CMOS lIc¢¢ vs. Switching Frequency Curve 
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Figure 6: DataGATE Architecture (output drivers not shown) 
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Global Signals 


Global signals, clocks (GCK), sets/resets (GSR) and output 
enables (GTS), are designed to strongly resemble each 
other. This approach enables design software to make the 
best utilization of their capabilities. Each global capability is 
supplemented by a corresponding product term version. 
Figure 7 shows the common structure of the global signal 
trees. The pin input is buffered, then drives multiple internal 
global signal traces to deliver low skew and reduce loading 
delays. The DataGATE assertion rail is also a global signal. 


DS090_07_101001 


Figure 7: Global Clocks (GCK), Sets/Resets (GSR) and 
Output Enables (GTS) 
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Additional Clock Options: Division, 
DualEDGE, and CoolICLOCK 

Division 

Circuitry has been included in the CoolRunner-Il CPLD 
architecture to divide one externally supplied global clock 
by standard values. Division by 2,4,6,8,10, 12, 14 and 16 
are the options (see Figure 8). This capability is supplied 
on the GCkK2 pin. The resulting clock produced will be 
50% duty cycle for all possible divisions. Note that a 
Synchronous Reset is included to guarantee no runt 
clocks can get through to the global clock nets. Note that 
again, the signal is buffered and driven to multiple traces 
with minimal loading and skew. 


DualEDGE 


Each macrocell has the ability to double its input clock 
switching frequency. Figure 9 shows the macrocell flip- 
flop with the DualEDGE option (doubled clock) at each 
macrocell. The source to double can be a control term 
clock, a product term clock or one of the available global 
clocks. The ability to switch on both clock edges is vital 
for a number of synchronous memory interface applica- 
tions as well as certain double data rate I/O applications. 


CoolICLOCK 


In addition to the DualEDGE flip-flop, additional power 
savings can be had by combining the clock division cir- 
cuitry with the DualEDGE circuitry. This capability is 
called CoolICLOCK and is designed to reduce clocking 
power within the CPLD. Because the clock net can be an 
appreciable power drain, the clock power can be reduced 
by driving the net at half frequency, then doubling the 
clock rate using DualEDGE triggering at the macrocells. 
Figure 10 shows how CoolCLOCK is created by internal 
clock cascading with the divider and DualEDGE flip-flop 
working together. 
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Figure 8: Clock Division Circuitry for GCK2 
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Figure 9: Macrocell Clock Chain with DualEDGE Option Shown 
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Figure 10: CoolCLOCK Created by Cascading Clock Divider and DualEDGE Option 


Design Security 


Designs can be secured during programming to prevent 
either accidental overwriting or pattern theft via readback. 
Four independent levels of security are provided on-chip, 
eliminating any electrical or visual detection of configuration 
patterns. These security bits can be reset only by erasing 
the entire device. Additional detail is omitted intentionally. 


Timing Model 


Figure 11 shows the CoolRunner-II CPLD timing model. It 
represents one aspect of the overall architecture from a tim- 
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ing viewpoint. Each little block is a time delay that a signal 
will incur if the signal passes through such a resource. 
Timing reports are created by tallying the incremental sig- 
nal delays as signals progress within the CPLD. Software 
creates the timing reports after a design has been 
mapped onto the specific part, and knows the specific 
delay values for a given speed grade. Equations for the 
higher level timing values (i.e., Tpp and Fsystem) are 
available. Table 5 summarizes the individual parameters 
and provides a brief definition of their associated func- 
tions. Xilinx application note XAPP375 details the 
CoolRunner-I| CPLD family timing with several examples. 


Summer 2002 


CoolRunner-|l CPLD Family 


ie 


D/T 





>< XILINX® 


Si 


TEN 


[ Toem |-—> 


DS090_11_12 


Figure 11: CoolRunner-ll CPLD Timing Model 


Table 5: Timing Parameter Definitions 


Table 5: Timing Parameter Definitions (Continued) 








Symbol 


Buffer Delays 


Parameter 


Symbol 


Parameter 


Macrocell Delays 


















































Tin Input Buffer Delay 

TEIN Fast data register input delay 

Teck Global clock (GCk) buffer delay 

Toesr Global set/reset (GSR) buffer delay 
Tets Global output enable (GTS) buffer delay 
Tout Output buffer delay 

TEN Output buffer enable/disable delay 
Ts_ew Output buffer slew rate control delay 








P-term Delays 


























hey Control Term delay (single PT or FB-CT) 
TLoci Single P-term logic delay 
TLogie Multiple P-term logic delay adder 
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Tpp Macro cell input to output valid 

Tsy Macro register setup before clock 

ry Macro register hold after clock 

Tecsu Macro register enable clock setup time 

TECHO Macro register enable clock hold time 

Toco Macro register clock to output valid 

Tra Macro register set/reset recovery before 
clock 

Tao} Macro register set/reset to output valid 

Tepiv Clock divider delay adder 

Tuys Hysteresis selection delay adder 

Feedback Delays 

le Feedback delay 

ToEM Macrocell to Global OE delay 
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In System Programming 


All CoolRunner-I| CPLD parts are 1.8V in system program- 
mable. This means they derive their programming voltage 
and currents from the 1.8V Vcc (internal supply voltage) 
pins on the part. The Vccio pins do not participate in this 
operation, as they may assume another voltage ranging as 
high as 3.3V down to 1.5V. A 1.8V Vcc is required to prop- 
erly operate the internal state machines and charge pumps 
that reside within the CPLD to do the nonvolatile program- 
ming operations. Xilinx software is provided to deliver the 
bit-stream to the CPLD and drive the appropriate IEEE 1532 
protocol. To that end, there is a set of IEEE 1532 commands 
that are supported in the CoolRunner-Il CPLD parts. 
Programming times are less than one second for 32 to 256 
macrocell parts. Programming times are less than four sec- 
onds for 384 and 512 macrocell parts. 


JTAG Instructions 


Table 6 shows the commands available to users. These 
same commands may be used by third party ATE products, 
as well. The internal controllers can operate as fast as 66 
MHz. The JTAG interface buffers are powered by a dedicat- 
ed power pin, VccAux ; which is independent of all other 


Supply pins. 


Table 6: JTAG Instructions 





























Code Instruction Description 

00000000} EXTEST | Force boundary scan data onto 
outputs 

00000011} PRELOAD | Latch macrocell data into 
boundary scan cells 

11111111) BYPASS | Insert bypass register between 
TDI and TDO 

00000010 INTEST | Force boundary scan data onto 
inputs and feedbacks 

OCCT OONeT. IDCODE Read IDCODE 

11111101 | USERCODE | Read USERCODE 

LA LO HIGHZ Force output into high 
impedance state 

aL LALO CLAMP Latch present output state 














Power-Up Characteristics 


CoolRunner-!] CPLD parts must operate under the demands 
of both the high-speed and the portable market places, 
therefore, they must support hot plugging for the high-speed 
world and tolerate most any power sequence to its various 
voltage pins. They must also not draw excessivecurrent dur- 
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ing power-up initialization. To those ends, the general 
behavior is summarized as follows: 


1. |/O pins are disabled until the end of power-up. 


2. As supply rises, configuration bits transfer from non- 
volatile memory to SRAM cells. 


3. AS power up completes, the outputs become as con- 
figured (input, output, or I/O). 


4. The process takes less than 25 ms and draws less 
than 5 mA of current. 


I/O Banking 


CoolRunner-Il CPLD 32 and 64 macrocell parts support a 
single Vccio rail that can range from 3.3V down to 1.5V 
operation. Two Vccio rails are supported on the 128 and 
256 macrocell parts where outputs on each rail can inde- 
pendently range from 3.3V down to 1.5V operation. Four 
Veclo rails are supported on the 384 and 512 macrocell 
parts with each rail independently supporting any voltage 
between 3.3V and 1.5V. The Vcc (internal supply voltage) 
for a CoolRunner-II| CPLD must be maintained within 1.8V 
+5% for correct soeed operation and proper in system 
programming. 


Mixed Voltage, Power Sequencing, 
and Hot Plugging 


As mentioned in I/O Banking, CoolRunner-Il CPLD parts 
support mixed voltage I/O signals where signals within the 
Same bank can range from 3.3V down to 1.5V. The power 
applied to the Vccjo and Vcc pins can occur in any order 
and the CoolRunner-Il CPLD will not be damaged. 
CoolRunner-I| CPLDs can reside on boards where the 
board is inserted into a “live” connector (hot plugged) and 
the parts will be well-behaved as If powering up in a stan- 
dard way. 


Development System Support 


Xilinx CoolRunner-I] CPLDs are supported by all configura- 
tions of Xilinx standard release development software as 
well as the freely available WebFITTER and ISE WebPACK 
software available from www.xilinx.com. Third party devel- 
opment tools include synthesis tools from Cadence, 
Exemplar, Mentor Graphics, Synplicity, and Synopsys. 


ATE Support 


Third party ATE development support is available for both 
programming and board/chip level testing. Vendors pro- 
viding this support include Hewlett-Packard, GenRad, and 
Teradyne. Other third party providers are expected to 
deliver solutions in the future. 
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Absolute Maximum Ratings") 














Symbol Parameter Min. Max. Unit 
Voc Supply voltage?) relative to GND —0.5 2.0 V 
V, Input voltage?) relative to GND —0.5 4.0 V 
lie) Maximum junction temperature —40 125 ae 
Tstr Storage temperature —65 150 C 























Notes: 

1. Stresses above those listed may cause malfunction or permanent damage to the device. This is a stress rating only. Functional 
operation at these or any other condition above those indicated in the operational and programming specification is not implied. 

2. Thechip supply voltage should rise monotonically. 


3. Maximum DC undershoot below GND must be limited to either 0.5V or 10 mA, whichever is easier to achieve. During transitions, the 
device pins may undershoot to —2.0V or overshoot to 3.5 V, provided this over- or undershoot lasts less than 10 ns and with the 
forcing current being limited to 200 mA. The I/O voltage may never exceed 4.0V. 


Quality and Reliability Parameters 











Symbol Parameter Min Max Units 
Tpr Data retention 20 - Years 
Npe Program/erase cycles (Endurance) 1,000 - Cycles 
Vesp Electrostatic discharge 2,000 - Volts 























Revision History 


The following table shows the revision history for this document. 





Date Version Revision 





01/03/02 eO Initial Xilinx release. 











Summer 2002 Xcell Journal 85 


Xilinx FPGA Product Selection Matrix 


CLB (1 CLB = 4 slices = Max 128 bits) Block SelectRAM 


VIRTEX-II 40x22 x 40x22 3,008 008 


wre oe 
cee eee ee 
216 


7M 





CLB Resources BLK RAM CLK Resources I/O Features 


Virtex-II Family — 1.5 Volt 





































































Reference | FPGAs 








LDT-25, LVPECL-33, cee 0.4 

80K | 168 | 512 | 1,152_| 1,024 | 16k | 8 | 144k | 8 | 24/420 | 4 | DCM| DCM| YES | 60 | 120 |  LVDS-33, LVDS-25, 0.6M 

250K | 24x16 | 1,536 | 3,456 | 3,072 | 48k | 24 | 432K | 24 | 24/420 | 8 | DCM| DCM| YES | 100 | 200 | LVDSEXT-33, LVDSEXT-25, 1.7M 

500K | 32x24 | 3,072 | 6,912 | 6144 | 96K | 32 | 576K | 32 | 24/420 | 8 | DCM|DCM| YES | 132 | 264 | BLVDS-25, ULVDS-25, 2.8M 

iM | 40x32 | 5,120 | 11,520 | 10,240 | 160K | 40 | 720K | 40 | 24/420 | 8 | DCM|DCM| YES | 216 | 432 | — LVTTL, LVCMos33, 4.1M 

15M | 48x40 | 7,680 | 17,280 | 15,360 | 240K | 48 | 864K | 48 | 24/420 | 8 | DCM| DCM| YES | 264 | 528 | LVCMOS25, LVCMOS18, B|5 |_5.7M 

2M | 56x48 | 10,752_| 24,192 | 21,504 | 336K _| 56 | 1008K | 56 | 24/420 | 8 | DCM| DCM | YES | 312 | 624 | LVCMOS15, PCI33, PCI66, 7.5M 

3M | 64x56 | 14,336 | 32,256 | 28,672 | 448K_| 96 | 1728K | 96 | 24/420 | 12 | DCM| DCM | YES | 360 | 720 | PCI-X, GTL, GTL+, HSTL, 10.5M 

912 | HSTL Il, HSTL Ill, HSTL W, 15.7M 

SSTL21, SSTL211, 21.9M 

am | 112x104 | 46,502 | 104,832 | 93,184 | 1456K | 168 | 3024K | 168 | 24/420 | 12 | DCM, DCM | YES | 554 | 1108 | SsTL31,sSTI3I 29.1M 
Virtex-E Family — 1.8 Volt .18um Six Layer Metal Process 

| 72k | 16x24 | 768 | 1,728 | 1,536 | 24K | 16 | 64k | NA | 25/350 | 8 | YES | YES | NA | 88 | 176_| 0.6M 

| 128K | 20x30 | 1,200 | 2,700 | 2,400 | 37.5k | 20 | 80K | NA | 25/350 | 8 | ves | YES | NA | 98 | 196 | 0.9M 

306K | 28x42 | 2,352_| 5,292 | 4,704 | 73.5K | 28 | 112K | NA | 25/350 | 8 | YES | YES | NA | 142 | 284 | — LVTTL, LVCMOS2, 145M 

A12K | 32x48 | 3,072 | 6,912 | 6144 | 96K | 32 | 128K | NA | 25/350 | 8 | YES | YES | NA | 158 | 316 |  LVCMOS18, PCI33, 1.88M 

570K | 40x60 | 4800 | 10,800 | 9,600 | 150K | 40 | 160K | NA | 25/350 | 8 | YES | YES | NA | 202 | 404 |  PCI66, GTL, GTL+, 2.7M 

986K | 48x72 | 6,912 | 15,552 | 13,824 | 216K | 72 | 288K | NA | 25/350 | 8 | YES | YES | NA | 256 | 512 | HSTLI, HSTL II, HSTL Wy, 5 |_3.97M 

1,569K | 64x96 | 12,288 | 27,648 | 24,576 | 384K | 96 | 384K | NA | 25/350 | 8 | YES | YES | NA | 330 | 660 |  SSTL31, SSTL3Il, 6.6M 

-2,188K_| 72x 180 | 25,920 | 34,992 | 51,840 | 486K | 144 | 576K | NA | 25/350 | 8 | YES | YES | NA | 362 | 724 | SSTL21, SSTL211, BLVDS, 8.4M 

-2,542K | 80x120 | 19,200 | 43,200 | 38,400 | 600K | 160 | 640K | NA | 25/350 | 8 | YES | YES | NA | 402 | 804 | LVDS, LVPECL 10.2M 

3,264K | 92x 138 | 25,392 | 57,132_| 50,784 | 793.5K | 184 | 736K _| NA | 25/350 | 8 | YES | YES | NA | 402 | 804 13M 

4074 | 104x156 | 32.448 | 73008 | 64,896 | T014K | 208 | 832K | NA | 25/350 | 8 | YES | YES | NA | 402 | 804 | 16.3M 
Virtex-EM Family — 1.8 Volt -18um Six cen Te Process 

131M | 40x60 | 4800 | 10,800 | 9,600 | 150K | 140 | 560K | NA | 25/350 | 8 | YES | YES | NA | 202 | 404 Same As 67-8 5 3.43M 
Spartan-IIE Family — 1.8 Volt .18um Six <a 1s Br 

| 50K | 16x24 | 768 | 1,728 | 1,536 | 24k | 8 | 32k | NA | 25/320 | 4 | ves | YES | NA | 84 | 182 | LVITLWCMOs2,WOMOsIg | -6- 0.6M 

100K | 20x30 | 1,200 | 2700 | 2,400 | 375K | 10 | 40k | NA | 25/320 | 4 | YES | YES | NA | 86 | 202 | PCIB3,PCI66,GTLGTL+, |_-6- ran 0.9M 

HSTL | HSTL Il, HSTLIV SSTL3 | 11M 

SST13 I SSTL2|SSM12 I, AGP-2K 1.4M 

300k | 32x48 | 3,072 | 6912 | 6144 | 96K | 16 | 64K | NA | 251320 | 4 | YES | YES | NA | 120 | 329 [CTT LVDS, BIDS, IVPECL 19M 

Spartan-Il Family — 2.5 Volt .22/.18um pare ae Metal Process 

| 15K | 8xi2 [192 | 432 | 384 | eK | 4 | 16K _| NA | 25/200 | 4 | ves | ves | NA | NA | 86 | LvTTLivcMos2, |_-5-6 | 5 0.2M 

| 30K | 12x18 | 432, | 972 | 864 | 135K | 6 | 24K | NA | 25/200 | 4 | ves | YES | NA | NA | 132 | PCI33(33VA5V), |_-5-6 0.4 

| 50K | 16x24 | 768 | 1,728 | 1,536 | 24k | 8 | 32K | NA | 25/200 | 4 | Yes | YES | NA | NA | 176 | PCI66(3.3V), GTL,GTL+,|_-5 -6 a, | |_0.6M 

100K | 20x30 | 1,200 | 2700 | 2,400 | 375K | 10 | 40k | NA | 25/200 | 4 | YES | YES | NA | NA | 196 | HSTLI,HSTLII,HSTLIV, |_-5-6 ~ |) 2 | 0.8M 

260_| SSTL3I, SSTL3 Il, SSTL2 |, |_-5 -6 11M 

200K 28 x 42 2352 a2 o2 4,704 735K 14 56K NA | 25/200 | 4 | YES | YES | NA NA 284 SSTL2 Il, AGP-2X, CTT B-6 =) 1.4M 





Note: 1. System Gates include 20-30% of CLBs used as RAM Important: Verify all Data with Device Data Sheet 
2. A Logic Cell is defined as a 4 input LUT and a register 
DCM — Digital Clock Management 
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FPGAS 





Xilinx Spartan FPGAs 


CLB Resources BLK RAM CLK Resources 1/O Features 










Spartan-lIE Family — 1.8 Volt .18um Six reco Metal Process 
50K | 16x24 | 768 | 1,728 | 1,536 | 24K | 8 | 32K | NA | 25/320 | 4 | YES | YES | NA | 84 | 182_| LVTTLLVCMOs2, lvaMost8 0.6M 
100K | 20x30_| 1,200 | 2,700 | 2,400 | 37.5K | 10 | 40K | NA | 25/320 | 4 | YES | YES | NA | 86 | 202 | PCI33,PCI66, GTL,GTL+, 0.9M 
1.1M 

ca 1.4M 

1.9M 

.22/.18um aap a Metal Process 


















Spartan-Il Family — 2.5 Volt 








| 15k | 8xt2_ | 192 | 432 | 384 | 6k | 4 | 16k | NA | 25/200 | 4 | ves | ves | NA | NA | 86 | LVTTL,WvcMOs2, | -5-6 | -5 | 0.2M 
| 30K | 12x18 | 432 | 972 | 864 | 135k | 6 | 24k | NA| 25/200 | 4 | ves | YES | NA | NA | 132 | PCB3@3vVa5V), | 5-6 0.4M 
50K | 16x24 | 768 | 1,728 | 1,536 | 24K | 8 | 32K | NA | 25/200 | 4 | YES | YES | NA | NA | 176 | PCI66(3.3V),GTL,GTL+,| -5 -6 & | & |_0.6M 
196 | HSTL|, HSTLI, HSTLIV, | -5 -6 | | 08M 
260__| SSTL3 |, SSTL3 Il SSTLZI, |_ -5 -6 11M 
200K | 28x42 2,352 | 5,292 | 4,704 | 735K | 14 | 56K | NA | 25/200 | 4 | YES | YES | NA | NA | 284 | SSTL2II,AGP-2X,CIT | -5-6 | -5 1.4M 


PACKAGE OPTIONS AND USER I/O 


: Note: 1. System Gates include 20-30% of CLBs used as RAM 
SPARTANCIIE SPARTAN? III 
VA 


2. A Logic Cell is defined as a 4 input LUT and a register 
DCM -— Digital Clock Management 
Important: Verify all Data with Device Data Sheet 


Numbers indicated in the matrix are the maximum number of user |/O’s for 
that package and device combination. 


vO's 182/202 |263 |289| 329 IM 86 1132 | 176/196 | 260 | 294 





PQFP Packages (PQ) 














208 146 |146 146 146 | 146 132 | 140 | 140| 140 | 140 
VQFP Packages (VQ) 

PT) Bolo) 
TQFP Packages (TQ) 

Emm oem | saan | 

Chip Scale Packages — wire-bond chip-scale BGA (0.8 mm ball spacing) 

PT Tf] asi) Ty 
FGA Packages (FT) — wire-bond fine-pitch thin BGA (1.0 mm ball spacing) 
Re 77mm 182 |182| 182 182, 182 aa 
FGA ttt (FG) — wire-bond fine-pitch BGA (1.0 mm ball spacing) 





176} 176] 176 | 176 
202 |. 263) | 269; 329 196 | 260 | 284 
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Xilinx FPGA Package Options and User 1/0 


User Available I/Os 


1 140 
Ss 
as 
—— 








.00 CT 


MISTER. lime) ico sian 
i035 
S100 Fase | 





SPARTAN-II 


Yo EX-E SPARTAN-IIE 





Yr 














22 





PQFP Packages (PQ) 
146 | 146 | 146 | 146 | 146 132} 140 140 | 140 | 140 
158 | 158 | 158 | 158 | 158 | 158} 158 
HQFP Packages (HQ) 
| ea) a | | | | 
144 rrr Beef 6 | jor joo] | 
BGA Packages (BG) — wire-bond standard BGA (1.27 mm ball spacing) 


208 
| 240 rr 58} | | fase} Pt 
Chip Scale Packages — wire-bond chip-scale BGA (0.8 mm ball spacing) 
ae 

Eile SEaie ir 

SRE 

||| [328 |392}aos] || 

ze 





eA 88 |120 | 200 |264 | 432 |528 | 624 | 720 | 912 | 1104 | 1296 
240 
VQFP Packages (VQ) 
is 53 en aa oF | esel Can Md Eel cand dN cn) 2m ead (Ga al Oem Nine 20a fa 
TQFP Packages (TQ) 
| 144 SA SE a a a 
FGA Packages (FT) — wire-bond fine-pitch thin BGA (1.0 mm ball spacing) 
56 ooo 
FGA Packages (FG) — wire-bond fine-pitch BGA (1.0 mm ball spacing) 


176 |176) 176) 176 176) 176 | 176) 176 
P [200 calaza] [|| Pe eee 


| | | | 392 ]456|4ea] | a 
ff f5t2}512|st2j5i2) || 
| | | | [660/660 }660) | | 
| | | | |st2}660|700} | || [556 


660 | 724 | 804 | 804) 804 








FFA Packages (FF) — flip-chip fine-pitch BGA (1.0 mm ball spacing) 
896 432 528 624 Note: Virtex-Il packages FG456 and FG676 are footprint compatible. 


1152 oe 824 Virtex-Il packages FF896 and FF1152 are footprint compatible. 


1517 912 | 1104! 1108 Important: Verify all Data with Device Data Sheet 


BFA Packages (BF) — flip-chip fine-pitch BGA (1.27 mm ball spacing) 
P| || 684 68 ox 








Numbers indicated in the matrix are the maximum number of user 
I/O's for that package and device combination. 





88 Xcell Journal Summer 2002 


CPLDs 





Xilinx CPLD Product Selection Matrix 


PRODUCT SELECTION MATRIX 


CoolRunner-Il Family — 1.8 Volt 


15/18/2533 | 1.5/1.8/2.53.3 
1.5/1.8/2.5/3.3} 1.5/1.8/2.533.3 
15/1.8/2.533.3 100. 


1.5/1.8/2.5/3.3 | 184 





ee) 






Ww | Ww 
WoL eGo 


1.5/1.8/2.5/3.3 
1.5/1.8/2.5/3.3 
CoolRunner XPLA3 Family — 3.3 Volt 
ee 
‘oe 
; 
XC9500XV Family — 2.5 Volt 
an [36/90/2583 | 192583 : 
1600 72) 90) 2.5/3.3 1.8/2.5/3.3 - Cool nner =f | | 
‘azo [tas] 90) 253 | 1312883 a (unner"! 
eso 98] 90 | 2503 | 18050. 
XC9500XL Family — 3.3 Volt 
a alan eaeeca eee - PACKAGE OPTIONS AND USER I/O 
ji |r 50| 25035 | 25097 
‘azo [144] 90| 25835 | 2503 (117 
6400 | 288} 90 | 2.5/3.3/5 a3: 


PLCC Packages (PC) 


a4 BRREEE DORR BER Bee 


PQFP Packages (PQ) 


2 ST fests TD edit Tiss Ths 


VQFP Packages (VQ) 

a 9 EE Ee 

64 aa aoe ag easy eee 36 | 52| 
sa]ea] | Realty 


c | eo|o0) 
Sei no 


TQFP Packages (TQ) 
ee eae 
z fio ia} tia) | fine 20 jr hii 
ne 

























34 a: 


72 [81 | 
a7 fn 


























































2 
ue mac 
2 Be 
= SA 


Chip Scale Packages (CS) — wire- [ue o 7 BGA (0.8 mm ball spacing) 

Te fis 
BGA Packages (BG) — wire-bond standard BGA (1.27 mm ball spacing) 
FBGA Packages (FG) — wire-bond Fineline BGA (1.0 mm ball spacing) 

PL fe Ty pe 


Chip Scale Packages (CP) — wire-bond chip-scale BGA (0.5 mm ball spacing) 
56 22 oe ee 
132 ES 
E tit islet T atet | alt 
144 Lee 
at 
256 EEREEREE EEREREERE EERE BE 
FGA Packages (FT) — wire-bond fine-pitch thin BGA (1.0 mm ball spacing) 
256 EES See 
324 240 270 220} 260 









*JTAG pins and port enable are not pin compatible in this package for this member of the family. 
Important: Verify all Data with Device Data Sheet and Product Availability with your local Xilinx Rep 
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Reference | Development Tools _| Iolo) 


Xilinx Software 


Device Support 


Design Planning 


Design Entry 


Synthesis 


Implementation Tools 


Board Level Integration 


Verification 


IP/CORE 

















No 


Synplicity Synplify/Pro Integration Kit (PC Only) Integration Kit (PC Only) 


Leonardo Spectrum Integration Kit Integration Kit Integration Kit Integration Kit 


Synopsys FPGA Compiler II EDIF Only EDIF Only EDIF Only EDIF Only 
ABEL CPLD CELD CPLD (PC Only) No 


=< 
(a>) 
n 


Xilinx Synthesis Technology (XST) 


a 


Feature ISE WebPACK I) i =¢- Ly), @ ISE Foundation ISE Alliance 
Plato re ee ee eee PCIUNI 
CoolRunner XPLA3 / CoolRunner™-I! Families es ee ee ALL 
Educational Services Yes ee Yes 
Schematic Editor (Gate & Block Level HDL) Yes eo No 
State Diagram Editor es No 
System Generator for DSP Sold as an Option 
a 
Integration Kit (PC Only) 
Integration Kit 
EIFOnly 
CPLD (PC Only) 
eed 


Ze pa | oz! |e || Io 
ws | 


* HSPICE Models available at the Xilinx Design Tools Center at www.xilinx.com/ise. 


KKK 


iMPACT Yes Ss Yes Yes 

Xilinx Constraints Editor Yes SS =. Yes 

Timing Driven Place & Route Yes = Yes 

LMG SmartModels Yes a Yes 

ModelSim Xilinx Edition (MXE II) ModelSim XE II Starter* ** 
3rd Party Simulator Support Yes = a Yes 

For more information on the complete list of Xilinx IP products, visit the Xilinx IP Center at http://www.xilinx.com/ipcenter 

* For Spartan, SpartanXL, and 4K device families, ISE supports the implementation by reading in gate-level EDIF netlist. 
MXE II supports the simulation of designs up to 1 million system gates and is sold as an option. 
Summer 2002 


90 Xcell Journal 





Configuration 





Xilinx Contiguration Storage Solutions 








Multiple Designs 
Non-Volatile Media 





cc) 
(2)] 
oO 
Som 
(e) 
~~ 
VY) 
(<}) 
Som 
= 
e 
(o) 
WY 


Min board space 


a Compression 





()) 
2 
© 
5 
= 
()) 
ce 
25cm Unlimited | Yes Yes | 30 Mbit/sec CompactFlash 


12.25 cm’| Yes SelectMAP (up to 4 FPGA) Up to 8 No | Yes | 152 Mbit/sec | AMD Flash Memory 
Slave-Serial (up to 8 FPGA chains) 





System ACE CF up to 8 Gbit 


Ea IRL Hooks 


7 Max Config. Speed 


CHAM @ eile mm Cerets 





— 


16 Mbit 
32 Mbit 
64 Mbit 


System ACE MPM 





H Memory Density 
PN Number of Components 





System ACE SC 16 Mbit | 3 | Custom | Yes SelectIMAP (up to 4 FPGA) Up to 8 No | No | Yes | 152 Mbit/sec | AMD Flash memory 
32 Mbit Slave-Serial (up to 8 FPGA chains) 
64 Mbit 


OTP Configuration PROMs for Spartan-lI/IIE 


exes yt nav Yt 
Das yr ir) |) aviv 
Haste fy) || iaaviv 
hese yfr) | iy aviv 
Pres) |) | yay 
ess irr) |) aaviv 


Nea: ‘ols: x 

Paw | fev pawl f Paso [yfyiy} || aviv) 

aw aay Coase [Vivi |) aay 
One-Time Programmable (OTP) Configuration PROMs Rese y 

| 16mb | fyfyfy] | [3avfy|y | xcasiso |y | y| 33v|¥ | Y 

awe | || lyfe laavfy caer Loe 

paw [EL fe iy iy paavi yf 

coe | visi 





Xilinx Home Page Xilinx Education Center 

http://www.xilinx.com http://www.xilinx.com/support/education-home.htm 

Xilinx Online Support Xilinx Tutorial Center 
http://www.xilinx.com/support/support.htm http://www.xilinx.com/support/techsup/tutorials/index.htm 
Xilinx IP Center Xilinx WebPACK 

http://www.xilinx.com/ipcenter/index.htm http://www.xilinx.com/sxpresso/webpack.htm 
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Reference 








Xilinx Global Services 


Xilinx Global Services 


Xilinx Design Services Part Number Product Description Duration | Availability 





Educational Services 


e Provide extensive FPGA hardware and 


embedded software design experience FPGA13000-4-ILT Fundamentals of FPGA Design (v4.x) 8 Now 
backed by industry recognized experts and FPGA16000-4-ILT ISE Design Entry (v4.x) — Instructor-Led course hee Now 
resources to solve even the most complex 
design challenge. FPGA23000-4-ILT Designing for Performance (v4.x) Now 
‘ System Architecture Consulting Be Pravide LANG1100-3-ILT VHDL — Introduction to VHDL (v3.x) — Instructor-Led Course Now 
engineering services to define system architecture LANG11000-4-ILT VHDL — Introduction to VHDL (v4.x) — Instructor-Led Course November 
and partitioning for design specification. 
LANG12000-4-ILT VERILOG — Introduction to Verilog (v4.x) — Instructor-Led Course Now 
: a= : : 
Custom Design Solutions — Project designed, PCI1800-3-ILT PCI CORE Basics (v3.x) — Instructor-Led Course 8 | Now 
verified, and delivered to mutually agreed 
upon design specifications. PCI18000-4-ILT PCI CORE Basics (v4.x) — Instructor-Led Course ea December 
e IP Core Development, Optimization, Integration, PCI2800-3-ILT PCI — Designing a PCI System (v3.x) — Instructor-Led Course Now 
Modification, and Verification —3 Modify, PC128000-4-ILT PCI — Designing a PCI System (v4.x) — Instructor-Led Course December 
integrate, and optimize customer intellectual — 
property or chiral party cores to wotlk nek PC|2900-3-ILT PCI-X — Designing for PCI-X (v3.x) Now 
Xilinx technology. Develop customer- PCI129000-4-ILT PCI-X — Designing for PCI-X (v4.x) December 
required special features to Xilinx We eee ASIC1500-3-|LT FPGA Design for the ASIC User (v3.x) — Instructor-Led Course Now 
or third party cores. Perform integration, 
optimization, and verification of IP cores ASIC25000-4-ILT FPGA Design for the ASIC User (v4.x) — Instructor-Led Course November 
in Xilinx technology. DSP2000-3-ILT DSP Implementation Techniques for Xilinx FPGAs November 
° Embedded Software — Develop complex PROMO-5002-4-LEL E-Series Il (v4) October 


embedded software with real-time constraints, = 
using hardware/software co-design tech- TE TCR One Training Credit ey 
niques. TC-0S-1DY One Day of On-Site Training = Now 
¢ Conversions — Convert ASIC designs and TC-OS-2DY Two days of On-Site Training Now 
other FPGAs to Xilinx technology and TC-OS-3DY Three days of On-Site Training Now 
devices. 
TC-OS-4DY Four days of On-Site Training Now 


aan : : . TC-OS-5DY Five days of On-Site Training 40 Now 
Xilinx Platinum Technical Services 
Platinum Technical Services 


e Senior Applications Engineers SC-PLAT-SVC-10 1 Seat Platinum Technical Service w/10 education credits Now 


¢ Dedicated Toll Free Number* SC-PLAT-SITE-50 Platinum Technical Service site license up to 50 customers Naeeeeral Now 
¢ Priority Case Resolution SC-PLAT-SITE-100 Platinum Technical Service site license for 51-100 customers iy Now 





e Proactive Status Updates SC-PLAT-SITE-150 Platinum Technical Service site license for 101-150 customers Now 
¢ Ten Education Credits Design Services 
e Electronic Newsletter DC-DES-SERV | Design Services Contract | | 


¢ Formal Escalation Process 
¢ Service Packs and Software Updates 


® Application Engineer to Customer Ratio, 


2x Gold Level 


* Available in US only 


Call Education Services Call Design Services Call Technical Services 

North America: 877-XLX-CLASS (877-959-2527) North America & Asia: North America: 800-255-7778 or 408-879-5199 
http://support.xilinx.com/support/training/training.htm Richard Foder: 408-626-4256 http://support.xilinx.com 

Europe: +44-870-7350-548 Europe: United Kingdom: +44-870-7350-610 
eurotraining@xilinx.com Alex Hillier: +44-870-7350-516 eurosupport@xilinx.com 

Japan: +81-03-5321-7750 | Martina Finnerty: +353-1-4032469 Japan: +81-03-5321-7750 
http://support.xilinx.co.jp/support/education-home.htm jhotline@xilinx.com 

Asia Pacific: http://support.xilinx.com/support/education-home.htm Other Locations: 


http://support.xilinx.com/support/services/contact_info.htm 
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For digital video, 
Spartan-IlE is the 
clear choice. 








The new Spartan-IIE FPGAs deliver over 20 billion The @SP Web Portal ... a Unique Designer’s Resource 

M ACs¢/s, giving you the DSP performance required The Xilinx ESP web portal gives you all the support you could wish for 

for high-resolution digital images. Together with as standards continue to rapidly change. 

advanced signaling standards including LVDS, Simply log on to xilinx.com/esp and ci FMERUYMT 5 TANARIS 

| HSTL and SSTL, you've got the speed you need you'll get immediate access to reference 

for digital video processing. boards, system block diagrams, a full spectrum of technology & stan- 
From Pictureto Pixel, the Complete Solution dards tutorials, and much more. You'll soon see why ESP is one of the 
Through capture, process and display, Spartan-IIE FPGAs bring the total most successful websites of its kind. 


solution into focus. Encryption IP, operating at high speed 
video rates, offers a new level of security protecting every pixel. 
And the desktop programmability of Spartan-1IE FPGAs 
solves your time-to-market pressures as design cycles get shorter. 


Visit www.xilinx.com/spartan2e/dvt today and find out 
why Spartan-I1E FPGAs are the clear choice for digital 
video design. 
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>. XILINX’ 


The Programmable Logic Company™ 


www. xilinx.com/spartan2e/dvt 
100 BEST COMPANIES TO woRK FOR 


®2001 Xilinx, Inc., 2100 Logic Drive, San Jose, CA 95124. Europe +44-870- 7350-600; Japan +81-3-5321-7711; Asia Pacific +852-2- 424-5200; Xilinx and Spartan are registered trademarks, WebPACK ia a trademark and The Programmable Logic Company is a service mark of Xilinx, Inc. 
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Up to 16 Embedded Rocket 1/0" — 
Mult-Gigabit Transcetvers 


Up to 8 Digital Cloc! Managers ae 


Awith XCITE Technology 


Up to 216 Multipliers 


VY 


ee . Available now, the new 

Virtex-I] Pro” Platform FPGAs 
herald an astonishing breakthrough in 
system-level solutions. With up to four IBM PowerPC” 405 
processors immersed into the industry's leading FPGA fabric, 
Xilinx/Conexant’s flawless high-speed serial 1/O technology, and 
Wind River System’s cutting-edge embedded design tools, Xilinx 
delivers a complete development platform of infinite possibilities. 
The era of the programmable system is here. 





THE POWER OF XTREME PROCESSING 

Each IBM PowerPC runs at 300+ MHz and 420 Dhrystone 
MIPS, and is supported by IBM CoreConnect” bus technology. 
In addition, the FPGA fabric enables technology 
ultra-fast hardware processing, such Neer 

as TeraM ACs/s DSP applications. And Xilinx’s unique |P- 
Immersion” architecture gives you the power of high-perform- 
ance processors, and easy integration of soft IP. A new perform- 
ance standard now enables hardware accelerated processing and 
multiple processing in a single off-the shelf device. 





THE ULTIMATE CONNECTIVITY PLATFORM 

The first programmable device to combine embedded processors 
along with Rocket I/O” 3.125 Gbps transceivers, the Virtex-I| Pro 
series supports 10 Gigabit Ethernet with XAUI, 3GIO, SerialATA, 
InfiniBand, you name it. And our Selectl/O“- Ultra supports 840 

M bps LVDS and other parallel methodologies. With 40+Gbps serial 
bandwidth, market challenges become market solutions. 

With so many features on a single device such as microproces- 
sors, multi-gigabit transceivers, highest density on-chip memory and 
on-chip termination, the benefits include simplified board layout, a 
reduced bill of materials, and unbeatable time to market. 


INDUSTRY-LEADING TOOLS FROM WIND RIVER AND XILINX 
Wind River's enbedded tools are the prenier support for real-time 
microprocessor and logic designs. And Xilinx’s lightning-fast 
[SE 4.21 software provides the most comprehensive, ~ 
easy-to-use development system available. se ¢ 
See the new Virtex-I1 Pro Platform FPGA in 
detail. Visit www.xilinx.com/virtex2pro today and 
step into the era of the programmable system. 
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